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1 

STREPTOCOCCUS PNEUMONIAE PROTEINS AND NUCLEIC ACID MOLECULES 

The present invention relates to proteins derived from Streptococcus pneumoniae, 
nucleic acid molecules encoding such proteins, the use of the nucleic acid and/or 
proteins as antigens/immunogens and in detection/diagnosis, as well as methods for 
5 screening the proteins/nucleic acid sequences as potential anti-microbial targets. 

Streptococcus pneumoniae, commonly referred to as the pneumococcus, is an 
important pathogenic organism. The continuing significance of Streptoccocus 
pneumoniae infections in relation to human disease in developing and developed 

10 countries has been authoritatively reviewed (Fiber, G.R., Science, 265: 1385-1387 
(1994)). That indicates that on a global scale this organism is believed to be the 
. most common bacterial cause of acute respiratory infections, and is estimated to 
result in 1 million childhood deaths each year, mostly in developing countries 
(Stansfield, S.K., Pediatr. Infect. Dis., 6: 622 (1987)). In the USA it has been 

15 suggested (Breiman et al, Arch. Intern. Med., 150: 1401 (1990)) that the 
pneumococcus is still the most common cause of bacterial pneumonia, and that 
disease rates are particularly high in young children, in the elderly, and in patients 
with j predisposing conditions such as asplenia, heart, lung and kidney disease, 
diabetes, alcoholism, or with immunosupressive disorders, especially AIDS. These 
20 groups are at higher risk of pneumococcal septicaemia and hence meningitis and 
therefore have a greater risk of dying from pneumococcal infection. The 
pneumococcus is also the leading cause of otitis media and sinusitis, which remain 
prevalent infections in children in developed countries, and which incur substantial 
costs. 

25 

The need for effective preventative strategies against pneumococcal infection is 
highlighted by the recent emergence of penicillin-resistant pneumococci. It has been 
reported that 6.6% of pneumoccal isolates in 13 US hospitals in 12 states were found 
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to be resistant to penicillin and some isolates were also resistant to other antibiotics 
including third generation cyclosporins (Schappert, S.M., Vital and Health Statistics 
of the Centres for Disease Control/National Centre for Health Statistics, 214:1 
(1992)). The rates of penicillin resistance can be higher (up to 20%) in some 
hospitals (Breiman et al, J. Am. Med. Assoc., 271: 1831 (1994)). Since the 
development of penicillin resistance among pneumococci is both recent and sudden, 
coming after decades during which penicillin remained an effective treatment, these 
findings are regarded as alarming. 

For the reasons given above, there are therefore compelling grounds for considering 
improvements in the means of preventing, controlling, diagnosing or treating 
pneumococcal diseases. 

Various approaches have been taken in order to provide vaccines for the prevention 
of pneumococcal infections. Difficulties arise for instance in view of the variety of 
serotypes (at least 90) based on the structure of the polysaccharide capsule 
surrounding the organism. Vaccines against individual serotypes are not effective 
against other serotypes and this means that vaccines must include polysaccharide 
antigens from a whole range of serotypes in order to be effective in a majority of 
cases. An additional problem arises because it ahs been found that the capsular 
polysaccharides (each of which determines the serotype and is the major protective 
antigen) when purified and used as a vaccine do not reliably induce protective 
antibody responses in children under two years of age, the age group which suffers 
the highest incidence of invasive pneumococcal infection and meningitis. 

A modification of the approach using capsule antigens relies on conjugating the 
polysaccharide to a protein in order to derive an enhanced immune response, 
particularly by giving the response T-cell dependent character. This approach has 
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been used in the development of a vaccine against Haemophilus influenzae. There 
are issues of cost concerning both the rnulti-polysaccharide vaccines and those based 
on conjugates. 

5 

A third approach is to look for other antigenic components which offer the potential 

to be vaccine candidates. In the present_application we provide a group of proteins 

_ - - ~\ 

antigens\which are secreted/exported proteins. > 

10 Thus, in a first aspect the present invention provides a Streptococcus pneumoniae 
protein or polypeptide having a sequence selected from those shown in table 2 
herein. 

A protein or polypeptide of the present invention may be provided in substantially pure 
15 form. For example, it may be provided in a form which is substantially free of other 
proteins. 

In a preferred embodiment, a protein or polypeptide having an amino acid sequence as 
shown in Table 3 is provided. 

20 

The invention encompasses any protein coded for by a nucleic acid sequence as shown 
in Table 1 herein. 

As discussed herein, the proteins and polypeptides of the invention are useful as 
25 antigenic material. Such material can be "antigenic" and/or "immunogenic". 
Generally, "antigenic" is taken to mean that the protein or polypeptide is capable of 
being used to raise antibodies or indeed is capable of inducing an antibody response in 
a subject. "Immunogenic" is taken to mean that the protein or polypeptide is capable of 
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eliciting a protective immune response in a subject. Thus, in the latter case, the protein 
or polypeptide may be capable of not only generating an antibody response and in 
addition non-antibody based immune responses. 

5 



The skilled person will appreciate that homologues or derivatives of the proteins or 
10 polypeptides of the invention will also find use in the context of the present invention, 
ie as antigenic/immunogenic material. Thus, for instance proteins or polypeptides 
which include one or more additions, deletions, substitutions or the like are 
encompassed by the present invention. In addition, it may be possible to replace one 
amino acid with another of similar "type". For instance replacing one hydrophobic 
15 amino acid with another. One can use a program such as the CLUSTAL program to 
compare amino acid sequences. This program compares amino acid sequences and 
finds the optimal alignment by inserting spaces in either sequence as appropriate. It is 
possible to calculate amino acid identity or similarity (identity plus conservation of 
amino acid type) for an optimal alignment. A program like BLASTx will align the 
20 longest stretch of similar sequences and assign a value to the fit. It is thus possible to 
obtain a comparison where several regions of similarity are found, each having a 
different score. Both types of analysis are contemplated in the present invention. 

In the case of homologues and derivatives, the degree of identity with a protein or 
25 polypeptide as described herein is less important than that the homologue or derivative 
should retain its antigenicity or immunogenicity to streptoccocus pneumoniae. 
However, suitably, homologues or derivatives having at least 60% similarity (as 
discussed above) with the proteins or polypeptides described herein are provided. 
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Preferably, homologues or derivatives having at least 70% similarity, more preferably 
at least 80% similarity are provided. Most preferably, homologues or derivatives 
having at least 90% or even 95 % similarity are provided. 

5 In an alternative approach, the homologues or derivatives could be fusion proteins, 
incorporating moieties which render purification easier, for example by effectively 
tagging the desired protein or polypeptide. It may be necessary to remove the "tag" or 
it may be the case that the fusion protein itself retains sufficient antigenicity to be 
useful. 

10 

In an additional aspect of the invention there are provided antigenic fragments of the 
proteins or polypeptides of the invention, or of homologues or derivatives thereof. 

For fragments of the proteins or polypeptides described herein, or of homologues or 
15 derivatives thereof, the situation is slightly different. It is well known that is possible to 
screen an antigenic protein or polypeptide to identify epitopic regions, ie those regions 
which are responsible for the protein or polypeptide's antigenicity or immunogenicity. 
Methods for carrying out such screening are well known in the art. Thus, the fragments 
of the present invention should include one or more such epitopic regions or be 
20 sufficiently similar to such regions to retain their antigenic/ immunogenic properties. 
Thus, for fragments according to the present invention the degree of identity is perhaps 
irrelevant, since they may be 100% identical to a particular part of a protein or 
polypeptide, homologue or derivative as described herein. The key issue, once again, is 
that the fragment retains the antigenic/immunogenic properties. 

25 

Thus, what is important for homologues, derivatives and fragments is that they possess 
at least a degree of the antigenicity/immunogenicity of the protein or polypeptide from 
which they are derived. 
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Gene cloning techniques may be used to provide a protein of the invention in 
substantially pure form. These techniques are disclosed, for example, in J. Sambrook 
et al Molecular Cloning 2nd Edition, Cold Spring Harbor Laboratory Press (1989). 
5 Thus, in a fourth aspect, the present invention provides a nucleic acid molecule 
comprising or consisting of a sequence which is: 



(i) any of the DNA sequences set out in Table 1 or their RNA equivalents; 

10 (ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

15 (iv) a sequence which is has substantial identity with any of those of (i), (ii) 

and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 1 . 



20 



In a fifth aspect the present invention provides a nucleic acid molecule comprising or 
consisting of a sequence which is: 



25 



(0 
(ii) 



any of the DNA sequences set out in Table 4 or their RNA equivalents; 
a sequence which is complementary to any of the sequences of (i); 
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(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

(iv) a sequence which is has substantial identity with any of those of (i), (ii) 
and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a protein 
as defined in Table 4. 

The nucleic acid molecules of the invention may include a plurality of such sequences, 
and/or fragments. The skilled person will appreciate that the present invention can 
include novel variants of those particular novel nucleic acid molecules which are 
exemplified herein. . Such variants are encompassed by the present invention. These 
may occur in nature, for example because of strain variation. For example, additions, 
substitutions and/or deletions are included. In addition, and particularly when utilising 
microbial expression systems, one may wish to engineer the nucleic acid sequence by 
making use of known preferred codon usage in the particular organism being used for 
expression. Thus, synthetic or non-naturally occurring variants are also included within 
the scope of the invention. 

The term "RNA equivalent" when used above indicates that a given RNA molecule has 
a sequence which is complementary to that of a given DNA molecule (allowing for the 
fact that in RNA "U" replaces "T" in the genetic code). 

When comparing nucleic acid sequences for the purposes of determining the degree of 
homology or identity one can use programs such as BESTFIT and GAP (both from the 
Wisconsin Genetics Computer Group (GCG) software package) BESTFIT, for 
example, compares two sequences and produces an optimal alignment of the most 
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similar segments. GAP enables sequences to be aligned along their whole length and 
finds the optimal alignment by inserting spaces in either sequence as appropriate. 
Suitably, in the context of the present invention compare when discussing identity of 
nucleic acid sequences, the comparison is made by alignment of the sequences along 
5 their whole length. 

Preferably, sequences which have substantial identity have at least 50% sequence 
identity, desirably at least 75% sequence identity and more desirably at least 90 or at 
least 95% sequence identity with said sequences. In some cases the sequence identity 
10 may be 99% or above. 

. Desirably, the term "substantial identity" indicates that said sequence has a greater 
degree of identity with any of the sequences described herein than with prior art nucleic 
acid sequences. 

15 

It should however be noted that where a nucleic acid sequence of the present invention 
codes for at least part of a novel gene product the present invention includes within its 
scope all possible sequence coding for the gene product or for a novel part thereof. 

20 The nucleic acid molecule may be in isolated or recombinant form. It may be 
incorporated into a vector and the vector may be incorporated into a host. Such vectors 
and suitable hosts form yet further aspects of the present invention. 

Therefore, for example, by using probes based upon the nucleic acid sequences 
25 provided herein, genes in Streptococcus pneumoniae can be identified. They can then 
be excised using restriction enzymes and cloned into a vector. The vector can be 
introduced into a suitable host for expression. 
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Nucleic acid molecules of the present invention may be obtained from S.pneumoniae by 
the use of appropriate probes complementary to part of the sequences of the nucleic 
acid molecules. Restriction enzymes or sonication techniques can be used to obtain 
appropriately sized fragments for probing. 

Alternatively PCR techniques may be used to amplify a desired nucleic acid sequence. 
Thus the sequence data provided herein can be used to design two primers for use in 
PCR so that a desired sequence, including whole genes or fragments thereof, can be 
targeted and then amplified to a high degree. One primer will normally show a high 
degree of specificity for a first sequence located on one strand of a DNA molecule, and 
the other primer will normally show a high degree of specificity for a second sequence 
located on the complementary strand of the DNA sequence and being spaced from the 
complementary sequence to the first sequence. 

Typically primers will be at least 15-25 nucleotides long. 

As a further alternative chemical synthesis may be used. This may be automated. 
Relatively short sequences may be chemically synthesised and ligated together to 
provide a longer sequence. 

In yet a further aspect the present invention provides an immunogenic/antigenic 
composition comprising one or more proteins or polypeptides selected from those 
whose sequences are shown in Tables 2-4, or homologues or derivatives thereof, 
and/or fragments of any of these. In preferred embodiments, the 
immunogenic/antigenic composition is a vaccine or is for use in a diagnostic assay. 

In the case of vaccines suitable additional excipients, diluents, adjuvants or the like 
may be included. Numerous examples of these are well known in the art. 
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It is also possible to utilise the nucleic acid sequences shown in Table 1 in the 
preparation of so-called DNA vaccines. Thus, the invention also provides a vaccine 
composition comprising one or more nucleic acid sequences as defined herein. The 
5 use of such DNA vaccines is described in the art. See for instance, Donnelly ex al , 
Ann. Rev. Immunol., 15:617-648(1997). 

As already discussed herein the proteins or polypeptides described herein, their 
homologues or derivatives, and/or fragments of any of these, can be used in methods 

10 of detecting/diagnosing S. pneumoniae. Such methods can be based on the detection 
of antibodies against such proteins which may be present in a subject. Therefore the 
present invention provides a method for the detection/diagnosis of S. pneumoniae 
which comprises the step of bringing into contact a sample to be tested with at least 
one protein, or homologue, derivative or fragment thereof, as described herein. 

15 Suitably, the sample is a biological sample, such as a tissue sample or a sample of 
blood or saliva obtained from a subject to be tested. 

In an alternative approach, the proteins described herein, or homologues, derivatives 
and/or fragments thereof, can be used to raise antibodies, which in turn can be used 
20 to detect the antigens, and hence S. pneumoniae. Such antibodies form another aspect 
of the invention. Antibodies within the scope of the present invention may be 
monoclonal or polyclonal. 

Polyclonal antibodies can be raised by stimulating their production in a suitable animal 
25 host (e.g. a mouse, rat, guinea pig, rabbit, sheep, goat or monkey) when a protein as 
described herein, or a homologue, derivative or fragment thereof, is injected into the 
animal. If desired, an adjuvant may be administered together with the protein. Well- 
known adjuvants include Freund's adjuvant (complete and incomplete) and aluminium 
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hydroxide. The antibodies can then be purified by virtue of their binding to a protein as 
described herein. 

Monoclonal, antibodies can be produced from hybridomas. These can be formed by 
fusing myeloma cells and spleen cells which produce the desired antibody in order to 
form an immortal cell line. Thus the well-known Kohler & Milstein technique {Nature 
256 (1975)) or subsequent variations upon this technique can be used. 

Techniques for producing monoclonal and polyclonal antibodies that bind to a 
particular polypeptide/protein are now well developed in the art. They are discussed in 
standard immunology textbooks, for example in Roitt et al. Immunology second edition 
(1989), Churchill Livingstone, London. 

In addition to whole antibodies, the present invention includes derivatives thereof which 
are capable of binding to proteins etc as described herein. Thus the present invention 
includes antibody fragments and synthetic constructs. Examples of antibody fragments 
and synthetic constructs are given by Dougall et al in Tibtech 12 372-379 (September 
1994). 

Antibody fragments include, for example, Fab, F(ab') 2 and Fv fragments. Fab 
fragments (These are discussed in Roitt et al [supra] ). Fv fragments can be modified 
to produce a synthetic construct known as a single chain Fv (scFv) molecule. This 
includes a peptide linker covalently joining V h and V, regions, which contributes to the 
stability of the molecule. Other synthetic constructs that can be used include CDR 
peptides. These are synthetic peptides comprising antigen-binding determinants. 
Peptide mimetics may also be used. These molecules are usually conformationally 
restricted organic rings that mimic the structure of a CDR loop and that include 
antigen-interactive side chains. 
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Synthetic constructs include chimaeric molecules. Thus, for example, humanised (or 
primatised) antibodies or derivatives thereof are within the scope of the present 
invention. An example of a humanised antibody is an antibody having human 
5 framework regions, but rodent hypervariable regions. Ways of producing chimaeric 
antibodies are discussed for example by Morrison et al in PNAS, 81, 6851-6855 (1984) 
and by Takeda et al in Nature. 314, 452-454 (1985). 

Synthetic constructs also include molecules comprising an additional moiety that 
10 provides the molecule with some desirable property in addition to antigen binding. For 
example the moiety may be a label (e.g. a fluorescent or radioactive label). 
Alternatively, it may be a pharmaceutically active agent. 

Antibodies, or derivatives thereof, find use in detection/diagnosis of S. pneumoniae. 
15 Thus, in another aspect the present invention provides a method for the 
detection/diagnosis of S. pneumoniae which comprises the step of bringing into contact 
a sample to be tested and antibodies capable of binding to one or more proteins 
described herein, or to homologues, derivatives and/or fragments thereof. 

20 In addition, so-called "Affibodies" may be utilised. These are binding proteins 
selected from combinatorial libraries of an alpha-helical bacterial receptor domain 
(Nord et al , ) Thus, Small protein domains, capable of specific binding to different 
target proteins can be selected using combinatorial approaches. 

25 

It will also be clear that the nucleic acid sequences described herein may be used to 
detect/diagnose S. pneumoniae. Thus, in yet a further aspect, the present invention 
provides a method for the detection/diagnosis of S. pneumoniae which comprises the 
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step of bringing into contact a sample to be tested with at least one nucleic acid 
sequence as described herein. Suitably, the sample is a biological sample, such as a 
tissue sample or a sample of blood or saliva obtained from a subject to be tested. 
Such samples may be pre-treated before being used in the methods of the invention. 
5 Trhus, for example, a sample may be treated to extract DNA. Then, DNA probes 
based on the nucleic acid sequences described herein (ie usually fragments of such 
sequences) may be used to detect nucleic acid from S. pneumoniae. 

In additional aspects, the present invention provides: 

10 

(a) a method of vaccinating a subject against S.pneumoniae which comprises the 
step of administering to a subject a protein or polypeptide of the invention, or a 
derivative, homologue or fragment thereof, or an immunogenic composition of the 
invention; 

15 

(b) a method of vaccinating a subject against S.pneumoniae which comprises the 
step of administering to a subject a nucleic acid molecule as defined herein; 

(c) a method for the prophylaxis or treatment of S.pneumoniae infection which 
20 comprises the step of administering to a subject a protein or polypeptide of the 

invention, or a derivative, homologue or fragment thereof, or an immunogenic 
composition of the invention; 

(d) a method for the prophylaxis or treatment of S.pneumoniae infection which 
25 comprises the step of administering to a subject a nucleic acid molecule as defined 

herein; 

(e) a kit for use in detecting/diagnosing S.pneumoniae infection comprising one 
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or more proteins or polypeptides of the invention, or homologues, derivatives or 
fragments thereof, or an antigenic composition of the invention; and 

(f) a kit for use in detecting/diagnosing S. pneumoniae infection comprising one 
5 or more nucleic acid molecules as defined herein. 

Given that we have identified a group of important proteins, such proteins are 
potential targets for anti-microbial therapy. It is necessary, however, to determine 
whether each individual protein is essential for the organism's viability. Thus, the 
10 present invention also provides a method of determining whether a protein or 

polypeptide as described herein represents a potential anti-microbial target which 
comprises inactivating said protein and determining whether S. pneumoniae is still 
viable, in vitro or in vivo. 

15 A suitable method for inactivating the protein is to effect selected gene knockouts, ie 
prevent expression of the protein and determine whether this results in a lethal 
change. Suitable methods for carrying out such gene knockouts are described in Li et 
al, P.N.A.S., 94:13251-13256(1997). 

20 In a final aspect the present invention provides the use of an agent capable of 

antagonising, inhibiting or otherwise interfering with the function or expression of a 
protein or polypeptide of the invention in the manufacture of a medicament for use in 
the treatment or prophylaxis of S. pneumoniae infection. 



25 



The invention will now be described with reference to the following examples, 
which should not be construed as in any way limiting the invention. The examples 
refer to the figures in which: 
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Figure 1: shows the results of various DNA vaccine trials; and 
Figure 2: shows the results of further DNA vaccine trials, 

EXAMPLE 1 

The Genome sequencing of Streptococcus pneumoniae type 4 is in progress at the 

Institute for Genomic Research (TIGR, Rockville, MD, USA). Up to now, the 
whole sequence has not been completed or published. On 21 st November 1997, the 
TIGR centre released some DNA sequences as contigs which are not accurate 
reflections of the finished sequence. These contigs can be downloaded from their 
w ^bster (www@tigr.org). We downloaded these contigs and created a local database 
using the application GCGToBLAST (Wisconsin Package Version 9.1, Genetics 
Computer Group (GCG), Madison, USA). This database can be searched with the 
FastA and TfastA procedures (using the method of Pearson and Lipman (PNAS USA, 
85:2444-2448 (1988)). 

Using FastA and TfastA procedures, the local pneumococcus database was searched 
for putative leader sequence or anchor sequence features. Relevant sequences were 
used to interrogate for comparative novel sequences. These were: 

(i) already described leader sequences of Streptococcus pneumoniae (from 
proteins NanA, NanB, LytA, PapA, pcpA, PsaA and PspA); 

(ii) the leader sequence of Usp45, a secreted protein from Lactococcus lactis; 

(iii) new hypothetical leader sequences derived from the searches in (i) and 
(ii); 
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(iv) the anchor motif LPxTG, a feature common to many Gram-positive 
bacteria surface proteins which are anchored by a mechanism involving the 
Sortase complex proteins. 

Provided below is an example of this approach, with reference to the sequences 
derived from the database (see table 1). 
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Usp45 



NanA 



PapA PsaA 



PspA 



PrtP 





4121.1 4101.3 4172.4 4106.1 4346.1 4194.1 4172.2 



4147.2 



J 



10 



15 



20 



4191.6 



4170.8 4175.3 



4137.3 4193.1 4122.1 



J 



J 



4187.2 4188.1 4102.1 




I 



4136.1 4127.1 4191.1 



J 



I 



4119.1 



4119.2 



25 



The protein leader sequences of different known exported proteins were used as a 
starting point for a search of the local pneumococcus database described above. The 
hypothetical proteins found with this search were then submitted to a Blast search in 
general databases such as EMBL, Swissprot etc. Proteins remaining unknown in the 
pneumococcus are kept and annotated. Then the search is performed again using the 
new potential protein leader sequence as a probe, using the TfastA procedure. 
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Example 2: DNA vaccine trials 
pcDNA3.1+ as a DNA vaccine vector 
5 pcDNA3.1-h 

The vector chosen for use as a DNA vaccine vector was pcDNA3.1 (Invitrogen) 
(actually pcDNA3 ; 1 + , the forward orientation was used in all cases but may be 
referred to as pcDNA3.1 here on). This vector has been widely and successfully 

10 employed as a host vector to test vaccine candidate genes to give protection against 
pathogens in the literature (Zhang, et al. y Kurar and Splitter, Anderson et al.). The 
vector was designed for high-level stable and non-replicative transient expression in 
mammalian ceils. pcDNA3.1 contains the ColEl origin of replication which allows 
convenient high-copy number replication and growth in E. coli. This in turn allows 

15 rapid and efficient cloning and testing of many genes. The pcDNA3. 1 vector has a 
large number of cloning sites and also contains the gene encoding ampicillin 
resistance to aid in cloning selection and the human cytomegalovirus (CMV) 
immediate-early promoter/ enhancer which permits efficient, high-level expression of 
the recombinant protein. The CMV promoter is a strong viral promoter in a wide 

20 range of cell types including both muscle and immune (antigen presenting) cells. 

This is important for optimal immune response as it remains unknown as to which 
cells types are most important in generating a protective response in vivo. A T7 
promoter upstream of the multiple cloning site affords efficient expression of the 
modified insert of interest and which allows in vitro transcription of a cloned gene in 

25 the sense orientation. 

Zhang, D., Yang, X., Berry, J. Shen, C, McClarty, G. and Brunham, R.C. (1997) 
"DNA vaccination with the major outer-membrane protein genes induces acquired 
immunity to Chlamydia trachomatis (mouse pneumonitis) infection". Infection and 
30 Immunity , 176, 1035-40. 

Kurar, E. and Splitter, G.A. (1997) "Nucleic acid vaccination of Brucella abortus 
ribosomal L7/L1 2 gene elicits immune response". Vaccine, 15, 1851-57. 

35 Anderson, R., Gao, X.-M., Papakonstantinopoulou, A., Roberts, M. and Dougan, 
G. (1996) "Immune response in mice following immunisation with DNA encoding 
fragment C of tetanus toxin". Infection and Immunity, 64, 3168-3173. 

Preparation of DNA vaccines 

40 

Oligonucleotide primers were designed for each individual gene of interest derived 
using the LEEP system. Each gene was examined thoroughly, and where possible, 
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primers were designed such that they targeted that portion of the gene thought to 
encode only the mature portion of the gene protein. It was hoped that expressing 
those sequences that encode only the mature portion of a target gene protein, would 
facilitate its correct folding when expressed in mammalian cells. For example, in the 
5 majority of cases primers were designed such that putative N-terminal signal peptide 
sequences would not be included in the final amplification product to be cloned into 
the pcDNA3.1 expression vector. The signal peptide directs the polypeptide 
precursor to the cell membrane via the protein export pathway where it is normally 
cleaved off by signal peptidase I (or signal peptidase II if a lipoprotein). Hence the 
10 signal peptide does not make up any part of the mature protein whether it be 

displayed on the surface of the bacteria surface or secreted. Where a N-terminal 
leader peptide sequence was not immediately obvious, primers were designed to 
target the whole of the gene sequence for cloning and ultimately, expression in 
pcDNA3.1. 

15 

Having said that, however, other additional features of proteins may also affect the 
expression and presentation of a soluble protein. DNA sequences encoding such 
features in the genes encoding the proteins of interest were excluded during the 
20 design of oligonucleotides. These features included: 

1. LPXTG cell wall anchoring motifs. 

2. LXXC ipoprotein attachment sites. 

3. Hydrophobic C-terminal domain. 

25 4. Where no N- terminal signal peptide or LXXC was present the start codon was 
excluded. 

5. Where no hydrophobic C-terminal domain or LPXTG motif was present the stop 
codon was removed. 

30 Appropriate PCR primers were designed for each gene of interest and any and all of 
the regions encoding the above features was removed from the gene when designing 
these primers. The primers were designed with the appropriate enzyme restriction 
site followed by a conserved Kozak nucleotide sequence (in all cases) GCCACC was 
used. The Kozak sequence facilitates the recognition of initiator sequences by 

35 eukaryotic ribosomes) and an ATG start codon upstream of the insert of the gene of 
interest. For example the forward primer using a BamHl site the primer would 
begin GCGGGATCCGCCACCATG followed by a small section of the 5' end of the 
gene of interest. The reverse primer was designed to be compatible with the forward 
primer and with a Notl restriction site at the 5' end in all cases (this site is 

40 TTGCGGCCGC). 

PCR primers 



SNSDOCID: <WO 0006737A2J_> 



WO 00/06737 



20 



PCT/GB99/02451 



The following PCR primers were designed and used to amplify the truncated genes 
of interest. 

5 

ID210 

Forward Primer 5' CGGATCCGCCACCATGTCTTCTAATGAATCTGCCGATG 

3' 

10 Reverse Primer 5' TTGCGGCCGCCTGTTTAGATTGGATATCTGTAAAGACTT 
3' 

4172.5 

1 5 Forward Primer 5 ' 

CGCGGATCCGCCACC ATGGATTTTCCTTCAAATTTGGAGG 3 ' 
Reverse Primer 5' TTGCGGCCGCACCGTACTGGCTGCTGACT 3' 

ID211 

20 

Forward Primer 5' 

CGGATCCGCCACCATG AGTGAGATC AA AATTATTAACGC 3 ' 
Reverse Primer 5" TTGCGGCCGCCGTTCCATGGTTGACTCCT 3' 

25 4197.4 

Forward Primer 5' CGCGGATCCGCCACCATGTGGGACATATTGGTGGAAAC 
3' 

Reverse Primer 5' TTGCGGCCGCTTCACTTGAGCAAACTGAATCC 3' 

30 

4122.1 

Forward Primer 5' 

CGCGGATCCGCCACC ATGTCAC AAGAA AAAACAAAAAATG AA 3 ' 
35 Reverse Primer 5' TTGCGGCCGCATCGACGTAGTCTCCGCC 3' 

4126.7 

Forward Primer 5 ' 

40 CGCGGATCCGCCACCATGCTGGTTGGAACTTTCTACTATCAAT 3' 
Reverse Primer 5' TTGCGGCCGCAACTTTCGTCCCTTTTTGG 3' 
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4188.11 

Forward Primer 5' CGCGGATCCGCCACCATGGGCAATTCTGGCGGAA 3' 
Reverse Primer 5' TTGCGGCCGCTTGTTTCATAGCTTTTTTGATTGTT 3' 

5 

ID209 

Forward Primer 5' 

CGCGGATCCGCCACC ATGCTATTGATACGAA ATGCAGGG 3 ' 
10 Reverse Primer 5' TTGCGGCCGCAACATAATCTAGTAAATAAGCGTAGCC 3' 

ID215 

Forward Primer 5' CGCGGATCCGCCACC ATGACGGCGACGAATTTTC 3' 
15 Reverse Primer 5" TTGCGGCCGCTTAATTCGTTTTTGAACTAGTTGCT 3' 

4170.4 

Forward Primer 5' 

20 CGCGGATCCGCCACC ATGGCTGTTTTTCTTCGCTATCATG 3* 

Reverse Primer 5' TTGCGGCCGCTTTCTTCAACAAACCTTGTTCTTG 3' 

4193.1 

25 Forward Primer 5' 

CGCGGATCCGCCACCATGGGTA ACCGCTCTTCTCGTAAC 3 ! 
Reverse Primer 5' TTGCGGCCGCGCTTCCATCAAGGATTTTAGC 3" 

Cloning 

30 

The insert along with the flanking features described above was amplified using PCR 
against a template of genomic DNA isolated from type 4 S: pneumoniae strain 11886 
obtained from the National Collection of Type Cultures. The PCR product was cut 
with the appropriate restriction enzymes and cloned in to the multiple cloning site of 

35 pcDNA3. 1 using conventional molecular biological techniques. Suitably mapped 
clones of the genes of interested were cultured and the plasmids isolated on a large 
scale (> 1.5 mg) using Plasmid Mega Kits (Qiagen). Successful cloning and 
maintenance of genes was confirmed by restriction mapping and sequencing ~ 700 
base pairs through the 5' cloning junction of each large scale preparation of each 

40 construct. 

Strain validation 
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A strain of type 4 was used in cloning and challenge methods which is the strain 
from which the S. pneumoniae genome was sequenced. A freeze dried ampoule of a 
homogeneous laboratory strain of type 4 S. pneumoniae strain NCTC 11886 was 
5 obtained from the National Collection of Type Strains. The ampoule was opened and 
the cultured re suspended with 0.5 ml of tryptic soy broth (0.5% glucose, 5% 
blood). The suspension was subcultured into 10 ml tryptic soy broth (0.5% glucose, 
5% blood) and incubated statically overnight at 37°C. This culture was streaked on 
to 5% blood agar plates to check for contaminants and confirm viability and on to 
10 blood agar slopes and the rest of the culture was used to make 20% glycerol stocks. 
The slopes were sent to the Public Health Laboratory Service where the type 4 
serotype was confirmed. 

A glycerol stock of NCTC 1 1886 was streaked on a 5 % blood agar plate and 
15 incubated overnight in a C02 gas jar at 37 °C. Fresh streaks were made and optochin 
sensitivity was confirmed. 

Pneumococcal challenge 

20 A standard inoculum of type 4 S. pneumoniae was prepared and frozen down by 

passaging a culture of pneumococcus lx through mice, harvesting from the blood of 
infected animals, and grown up to a predetermined viable count of around 10 9 
cfu/ml in broth before freezing down. The preparation is set out below as per the 
flow chart. 



25 



30 



35 



Streak pneumococcal culture and confirm identity 

I 

V 

Grow over-night culture from 4-5 colonies on plate above 



Animal passage pneumococcal culture 
(i.p. injection of cardiac bleed to harvest) 



40 



Grow over-night culture from animal passaged pneumococcus 
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Grow day culture (to pre-determined optical density) from over-night of animal 
passage and freeze down at -70°C - This is standard minimum 



Thaw one aliquot of standard inoculum to viable count 



Use standard inoculum to determine effective dose (called Virulence Testing) 

I 

V 

All subsequent challenges - use standard inoculum to effective dose 

An aliquot of standard inoculum was diluted 500x in PBS and used to inoculate the 
mice. 

Mice were lightly anaesthetised using halothane and then a dose of 1.4 x 10 5 cfu of 
pneumococcus was applied to the nose of each mouse. The uptake was facilitated by 
the normal breathing of the mouse, which was left to recover on its back. 

S. pneumoniae vaccine trials 

Vaccine trials in mice were carried out by the administration of DNA to 6 week old 
CBA/ca mice (Harlan, UK). Mice to be vaccinated were divided into groups of six 
and each group was immunised with recombinant pcDNA3. 1 + plasmid DNA 
containing a specific target-gene sequence of interest. A total of 100 jug of DNA in 
Dulbecco's PBS (Sigma) was injected intramuscularly into the tibialis anterior 
muscle of both legs (50 fil in each leg). A boost was carried using the same 
procedure 4 weeks later. For comparison, control groups were included in all 
vaccine trials. These control groups were either unvaccinated animals or those 
administered with non-recombinant pcDNA3.1 + DNA (sham vaccinated) only, 
using the same time course described above. 3 weeks after the second immunisation, 
all mice groups were challenged intra-nasally with a lethal dose of S. pneumoniae 
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serotype 4 (strain NCTC 1 1886). The number of bacteria administered was 
monitored by plating serial dilutions of the inoculum on 5% blood agar plates. A 
problem with intranasal immunisations is that in some mice the inoculum bubbles out 
of the nostrils, this has been noted in results table and taken account of in 
5 calculations . A less obvious problem is that a certain amount of the inoculum for 
each mouse may be swallowed. It is assumed that this amount will be the same for 
each mouse and will average out over the course of innoculations. However, the 
sample sizes that have been used are small and this problem may have significant 
effects in some experiments. All mice remaining after the challenge were killed 3 or 

10 4 days after infection. During the infection process, challenged mice were monitored 
for the development of symptoms associated with the onset of 5. pneumoniae 
induced-disease. Typical symptoms in an appropriate order included piloerection, 
an increasingly hunched posture, discharge from eyes, increased lethargy and 
reluctance to move. The latter symptoms usually coincided with the development of 

15 a moribund state at which stage the mice were culled to prevent further suffering. 

These mice were deemed to be very close to death, and the time of culling was used 
to determine a survival time for statistical analysis. Where mice were found dead, 
the survival time was taken as the last time point when the mouse was monitored 
alive. 

20 

Interpretation of Results 

A positive result was taken as any DNA sequence that was cloned and used in 
challenge experiments as described above which gave protection against that 

25 challenge. Protection was taken as those DNA sequences that gave statistically 

significant protection (to a 95% confidence level (p<0.05)) and also those which were 
marginal or close to significant using Mann- Whitney or which show some protective 
features for example there were one or more outlying mice or because the time to the 
first death was prolonged. It is acceptable to allow marginal or non-significant results 

30 to be considered as potential positives when it is considered that the clarity of some of 
the results may be clouded by the problems associated with the administration of 
intranasal infections. 
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Table 1 



4101. 1 

ATGGAAGAGTTAGTGACCTTAGATTGTTTGTTTATTGACAGAACT 
5 TTGTGTGGAAGAAAACGACAGAGAAATTCTCCGCCAAACTTCA 

TCACTCCCCTTCTGATTAAATATGCCATGTTTGATAAGAAACAAAAGAG 

TAGCGAATTGGCACTATAATGACAAGGAGGATAGCTACACACATCCTGATGGCTGGTATTATCGTTTTCACCATAC 
CAAATATCAGAAAACACAGACAGACnTTCAACAAGAAATCAAGGTTTACTACGCCGACGAACCTGAATCAGCCCC 
TCAAAAGGGACTGTATATGAACGAACGCTATCAAAACTrGAAAGCTAAAGAATGTCAGG 
10 AGGTAGACAGATTTTCGCTCAACGCAAGATTGATGTGGAj\CCTGTCTTTGGGCAGATAAAGGCrrcr I" I GGGTTAC 

AAGAGATGTAATCTGAGAGGGAAGCGTCAAGTGAGAATTGACATGGGATTGGTACTTATGGCCAATAACCTCCTA 
AAATATAGTAAAATGAAATAA 

4101.3 

15 ATGGGGAAAGGCCATTGGAATCGGAAAAGAGTTTATAGCATTCGTAAGTTTGCTGTGGGAGCTTGCTCAGTAATG 
ATTGGGACTTGTGCAGTTTTATTAGGAGGAAATATAGCTGGAGAATCTGTAGTrrATGCGG ATGA AACACTTATTA 
CTCATACTGCTGAGAAACCTAAAGAGGAAAAAj\TGATAGTAGAAGAAAAGGCTGATAAAGCTTTGGAj\ACT 
AATATAGTTGAAAGGACAGAACAAAGTGAACCTAGTTCAACTGAGGCTATTGCATCTGAGAAGAAAGAj\GATGAA 
GCCGTAACTCCAAAAGAGGAAAAAGTGTCTGCTAAACCGGAAGAAAAAGCTCCAAGGATAGAATCACAAGCT^ 

20 AAATCAAGAAjW\ACCGCTCAAGGAAGATGCTAAAGCTGT^ 

ggaaagtggattttaatcaaaattggtactttaaactcaatg 

acgtatctacgtggaaaaaattagatttaccgtatgactggagtatcttc 

acaaaatgaaggtggacagctcaacggtggggaagcttggtatcgca^ 

. c aag aaaaatgttcg ccttac1 tt 1 g atggcgtctacatgg attctcaagtttatgtcaatggtcagttagtgggg 
25 cattatccaaatggttata^ccagttctcatatgatatcaccaaataccttcaaaaagatggtcgtgagaatgtga 
ttgctgtccatgcagtcaacaaacagccaagtagccgttggtattcaggaagtggtatctatcgtgatgtgacttt 
acaagtgacagataaggtgcatgttgagaaaaatgggacaactattttaacaccaaaacttgaagaacaacaa 
tggcaaggttgaaactcatgtgaccagcaaaatcgtcaatacggacgacaaagaccatgaacttgtagccgaata 
tcaaatcgttgaacgaggtggtcatgctgtaacaggcttagttcgtacagcgagtcgtaccttaaaagcacatga 
30 atcaacaagcctagatgcgatttragaagttgaaagaccaaaactctggactgtttt^ 
tacgaattgattacgcgtgtttaccgtgacggtcaattggttgatgc 

ATCACTGGACTCCAAATGAAGGTTTCTCTTTGAj\TGGTG/^ 

CCATGGGGCGCTTGGAGCAGAAGAAAACTATAAAGCAGAATATCGCCGTCTCAAACAAATGAAGGAGATGGGAG 
TTAACTCCATCCGTACAACCCACAACCCTGCTAGTGAGCAAACCTTGCAAATCGCAGCAGAACTAGGTTTACT 
35 TCAGGAAGAGGCCTTTGATACGTGGTATGGTGGCAAGAAACC^TATGACTATGGACGrriCrilGAAAAAGATGC 
CACTCACCCAGAAGCTCGAAAAGGTGAAAAATGGTCTGATTTTGACCTACGTACCATGGTCGAAAGAGGCAAAAA 
CAACCCTGCTATCTTCATGTGGTCAATTGGTAATGAAATAGGTGAAGCT^ 

GTTAAACGTTTGGTTAAGGTTATCAAGGATGTTGATAAGACTCGCTATGTTACCATGGGAGCAGATAAATTCCGTT 
TCGGTAATGGTAGCGGAGGGCATGAGAAAATTGCTGATGAACTCGATGCTGTTGGATTTAACTATTCTGAAGATA 

40 ATTACAAAGCCCTTAGAGCTAAGCATCCAAAATGGTTGATTTATG 

TGGAAGTTACTATCGCCCTGAACGTGAATTGAAACATAGCAATGGACCTGAGCGTAATTATGAACAGTCAGATTA 
TGGAAATGATCGTGTGGGTTGGGGGAAAACAGCAACCGCTTCATGGACTTTTGACCGTGACAACGCTGGCTATGC 
TGGACAGTTTATCTGGACAGGTACGGACTATATTGGTGAACCTACACCATGGCACAACCAAAATCAAACTCCTGTT 
AAGAGCTCTrACTTTGGTATCGTAGATACAGCCGGCATTCCAAAACATGACTTCTATCTCTACCAAAGCCAATGGG 

45 TTTCTGTTAAGAAGAAACCGATGGTACACCI TCI 1 CCTCACTGGAACTGGGAAAACAAAGAATTAGCATCCAAAG 

TAGCTGACTCAGAAGGTAAGATTCCAGTTCGTGCTTATTCGAATGCTTCT^ 

ATCTCTTGGTCTTAAGACTTTCAATAAAAAACAAACCAGCGATGGGCGGACTTACCAAGAAGGTGCAAAT^ 
TGAACTTTATCTTGAATGGAAAGTTGCCTATCAACCAGGTACCTTC 

GAAATTGCTCGAGATAAGATTACGACTGCTGGTAAGCCAGCGGCAGTTCGTCTTATTAAGGAAGACCATGCGATT 
50 GCAGCAGATGGAAAAGACTTGACTTACATCTACTATGAAATTGTTGACAGCCAGGGGAATGTGGTTCCAACTGCT 
AATAATCTGGTTCGCTTCCAATTGCATGGCCAAGGTCAACTGGTCGGTGTAGATAACGGAGAACAAGCCAGCCGT 
GAACGCTATAAGGCGCAAGCAGATGGTTCTTGGATTCGTAAAGCATTTAATGGTAAAGGTGTTGCCATTGTCAAAT 
CAACTGAACAAGCAGGGAAATTCACCCTGACTGCCCACTCTGATCTCTTGAAATCGAACCAAGTCACTGTCT^ 
TGGTAAGAAAGAAGGACAAGAGAAGACTGTTTTGGGGACAGAAGTGCCAAAAGTACAGACCATTATTGGAGAGG 
55 CACCTGAAATGCCTACCACTGTrCCGTTTGTATACAGTGATGGTAGCCGTGCAGAACGTCCTGTAACCTGGTCTTC 
AGTAGATGTGAGCAAGCCTGGTATTGTAACGGTGAAAGGTATGGCTGACGGACGAGAAGTAGAAGCTCGTGTAGA 
AGfGATTGCTCTTAAATCAGAGCTACCAGTTGTGAAACGTATTGCTCCAAATACTGACTTGAATTCTGTAGA 
TCTGTTTCCTATGTTTTGATTGATGGAAGTGTTGAAGAGTATGAAGTGGACAAGTGGGAGATTGCCGAAGAAGATA 
AAGCTAAGTTAGCAATTCCAGGTTO*CGTATTCAAGCGACCGGTTATTTAGAAGGTCAACCAATTCATGCAACCCT 
60 TGTGGTAGAAGAAGGCAATCCTGCGGCACCTGCAGTACCAACTGTAACGGTTGGTGGTGAGGCAGTAACAGGTCT 
TACTAGTCAAAAACCAATGCAATACCGCACTCTTGCTTATGGAGCTAAGTTGCCAGAAGTCACAGCAAGTGCTAA 
AAATGCAGCTGTTACAGTTCTTCAAGCAAGCGCAGCAAACGGCATGCGTGCGAGCATCTTTATTCAGCCTAAAGA 
TGGTGGCCCTCTTCAAACCTATGCAATTCAATTCCTTGAAGAAGCGCCAAAAATTGCTCACTTGAGCT 
GAAAAAGCTGACAGTCTCAAAGAAGACCAAACTGTCAAATTGTCGGTTCGAGCTCACTATCAAGATGGAACGCAA 
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GCTGTATTACCAGCTGATAAAGTAACCTTCrCTACAAGTGGTGAAGGGGAAGTCGCAATTCGTAAAGGAATGCTT 

GAGTTGCATAAGCCAGGAGCAGTCACTCTGAACGCTGAATATGAGGGAGCTAAAGACCAAGTTGAACTCACTATC 

CAAGCCAATACTGAGAAGAAGATTGCGCAATCCATCCGTCCTGTAAATGTAGTGACAGATTTGCATCAGGAACCA 

AGTCTTCCAGCAACAGTAACAGTTGAGTATGACAAAGGTTTCCCTAAAACTC 

CGAAj\GAAmAAACTAGACTCCTATCAAACATTTGAAGTACTAGGTAAAGTTGAAGGAATTGA 

caaaagtctctgtagaaggtatcgtttcagttgaagaagtcagtgtgacaactccaatcgcagaagcaccacaat 

taccagaaagtgttcggacatatgattcaa^tggtcacgtttcatcagctaaggttgcatgggatgcgattcgtcc 

agagcaatacgctaaggaaggtgtctttacagttaatggtcgcttagaaggtacgcaattaacaactaaacttca 

tgttcgcgtatctgctcaaactgagcaaggtgcaaacatttctgaccaatggaccggttcagaattgccacttgcc 

tttgcitcagactcaaatccaagcgacccagtttcaaatgt^ 

ccaatcgttggacaaactggaatcgtactaatccagaagctrcagtcggtgttctgtttggagattcagct 

gagcaaacgctccgttgataatcraagtgtcggattccatgaagaccatggagttggtgtaccgaagtcttatgtg 

attgagtattatgttggtaagactgtcccaacagctcctaaaaaccctagttttgttggtaatgaggaccatgtct 

ttaatgattctgccaj^ctggaaaccagttactaatctaaaagcccctgctcaact 

actttagctttgataaagttgaaacctatgcrgttcgtattc 

tatcacagaggtacaaatctttgcgaaacaagttgcggcagccaagcaaggacaaacaagaatccaagttgacgg 
caaagacttagcaaacttcaaccctgatttgacagactactac 

acagcaagtgttagcaacaatggtctcgctaccgtcgttccaagcgttcgtgaaggtgagccagttcgtgtcatcg 

cgaaagctgaaaatggcgacatcttaggagaataccgtctgcacttcactaaggataagagcttactttct^ 

aaccagttgctgcggttaaacaagctcgcttgctacaagtaggtc 

ttacttcacaggtaaagacggctacgaa^caaaagacctgacagttgaatgggaagaagttccagcggaaaatct 
gacaaaagcaggtcaattttactgttcgaggccgtgtccttggtagtaacctrgttgctgagatcactgtacgagtg 
acagacaaacttggtgagactctttcagataaccctaactatgatc 

accaatgatatrgacaaaaactctcatgaccgcgttgactatctcaatgacggagatcattcagaaaatcgtcgtt 

ggacaaactggtcaccaacaccatcttctaatccagaagtatcagcgggtgtgattttccgtgaaaatggtaaga 

ttgtagaacggactgttacacaaggaaauagttcagttctttgcagatagtggtacggatgcaccatctaaactcgt 

tttagaacgctatgtcggtccagagtttgaagtgccaacctactattcaaactaccaagcctacgacgcagaccat 

ccattcaacaatccagaaaattgggaagctgttccttatcgtgcggataaagacattgcagctggtgatgaaatc 

aacgtaacatttaaagctatcaaagccaaagctatgagatggcgtatggagcgtaaagcagataagagcggtgtt 

gcgatgattgagatgaccttccttgcaccaj\gtgaattgcctcaagaaagcactcaatcaaagattcttgtagatg 

gaaaagaacttgctgatttcgctgaaaatcgtcaagactatcaaattacctataaaggtcaacggccaaaagtct 

cagttgaagaaaacaatcaagtagcttcaactgtggtagatagtggagaagatagctttccagtacitgttcgcct 

cgtttcagaaagtggaaaacaagtcaaggaataccgtatccacttgactaaggaa 

aaaccagtttctgagaagacagttgctgctgtacaagaagatcttccaaaaatcgaatttgttgaaaa 

gcatacaagacagttgagaaauaaagattcaacactgtatctaggtgaaactcgtgtagaacaagaaggaaaagtt 

ggaaaagaacgtatcnttacagcgattaatcctgatggaagtaaggaagaaaaactccgtgaagtggtagaagtt 

ccgacagaccgcatcgtcttggttggaaccaaaccagtagctcaagaagctaaaaaaccacaagtgtcagaaaaa 

gcagatacaaaaccaattgattcaagtgaagctagtcaaactaataaagcccagttaccaagtacaggtagtgcg 

gcaagccaagcagcagtagcagcaggtttaactcttctaggtttgagtgcaggattagtagttactaaaggtaaa 

aaagaagactag 



4101.5 

ATG GATG CAATCTTrGACCTAATCGGAAAGGTTTTCAATCCCATCTTAGAAATGGGTGGACCTGTCATCATG 
TCATTTTGACAGTATTGGCTTTACTTTTTGGAGTGAAAT^ 
CGCTCTTACAGGTATCGGTGCTATCATCGGTATGCTAAACACTGC^^ 
AACACTGGTATCCAATTGAGTATTACCGACGTTGGTTGGGCACCACTTGCT 

CACTAT ACTT CTTGCTCATCATGTTGATTGTCAACATAGTGATGCTAGCTATGAAGAAAACAGATACACTTGATGT 

CGATATCTTTGATA TCTG GCACTTGTCTATCACAGGTCTCTTGATTAAATGGTATGCTGATAACAATGGTGTGAGT 

CAAGGGGTTTCACTCTTTATTGCTACAGCAGCTATCGTCCHTGTCGGTGTGTTGAAAATTATCAACT 

GAAACCTACATTTGATGACCTTCTTAACGCCCCAAGTTCATCACCAATGACATCAACTCACATGAACTACATGATG 

AACCCAGTTATCATGGTTTTGGATAAGATTTTTGAAAAATTCTrCCCAGGCCITGA 

CTAAATTGAACAAGAAAATCGGTTTCTGGGGATCTAAATTCITCATC 

ATGGGAACTCCACATCCAAT TGCAG GTGTTGCAGATGCAGATAAATGGCGTCTTGTTATCAAAGGATGGTTGTCTC 

TTGGTTTGACTGCCGGTGTATCTTTGGAACrrcnTCrCACTTATCGGTTCATGGTTCATCGCAGCCGTAG^ 

TCACAAGGTATTACAAACGTTGCTACTAAACGTCTTCAAGGACGTAAATTCAATATCGGTCTTGACTGGCCATTCA 

TCGCTGGTCGTGCTGAAATCTGGGCTTGTGCCAACGTACTTGCACCAATCATGTTGATTGAAGCAGTGCl'rcriU' 

AAAAGTTGGAAATGGTATCTTGCCACTTGCAGGTATCATCGCTATGGGTGTTACTCCAGCTCTCTTGGTTGTAACT 

CGTGGTAAATTGCTCCGTATGATTATCITCGGAACACTCTTGTTGCCACrCTTCCr i ' C n TCAGGTACACTTATTGC 

ACCATTTGCAACAGAACTTGCTAAAGGTGTAGGTGCCTTCCCAGAAGGTGTGAGCCAAACTCAATTGATT 

TCTACTCTTGAAGGACCAATCGAAAAACTTCTTGGTTGGACAATTGGTAACACTACAACTGGTGATATCAA^ 

TCCTTGGTGCAGTAGTCnTCCTTGTATTCTATATCGGTATCTTTGCTTGGTACAGAAj^ 

CGAAGAGTACGCAGCAAAAGCAAAATAA 

4102.1 
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ATGAAGATTATGAAAAAAAAATATTGGACTTTAGCGATATTAT I CTT1T GTTTGTTCAATAATTCTGTTACTGCTCA 
AGAAATACCTAAAAATCTTGATGGCAATATAACTCACACTCAGACTAGCGAAAGTTITTCT 
ACAGGTTGACTATTCTAATAAAAATCAAGAAGAAGTAGACCAAAATAAATT^ 
ATTATTTGTAACAACAGATAAACATTTAGAAAAAAACTGTTGTAAATTGGAACTTGAA 
5 TATTGTTAmACTCTGAAAGTAATAATTTACTAGGCGAAGATAATTTAGATAATAAAATTAAGGAA^ 

CTAGATAATAGAGGAGGAj\ATATAGAGCATGACAAAGATAACTTAGAATCGTCGATrGTAAGAAAATATGAATGG 
GAT^TAGATAAAGTTACTGGTGGAGGCGAAAGTTATAAATTATATTCTAAAAG 
TTTTAGATTCAGGAGTCGATTTACAAAATACTGGATTACTGAAAAATCTTTCAAATCACTCA^ 
CAATAAAGGATATTTAGGAAAAGAGGAGGGAGAGGAAGGAATAATATCAGATATTCAAGATAGATTAGGTCATG 
10 GTACGGCTGTTGTAGCTCAAATTGTAGGGGATGACAATATTAATGGAGTAAATCCTCACGTTAATATTAACGTCTA 
TAGAATATTTGGTAAGTCGTCAGCTAGTCCAGATTGGATTGTAAAAGCAATTTTTGATGCTGTAGATGATGGCAAT 
GATATTATCAATCTTAGTACTGGACAATATTTAATGATTGATGGAGAATA 

CAl 1 11 IGAAGTATAAAAAGGCTATTGATTACGCGAATCAAAAAGGAGTAATTATAGTAGCTGCATTAGGGAATG 

ACTCCCTAAATGTATCAAATCAGTCAGATTTATTGAAACTTATTAGTTCACGC^ 
15 AGTAGTTGATGTT CCAA GTTATTTCTCATCTACAATTTCGGTCGGAGGCATAGATCGCTTAG 

GTAATTTATCAGATITTA GCAAT AAAGGGGATTCTGATGCAATATATGCGCCTGCAGGCTCAACATTATCTCTTTC 

AGAATTAGGACTTAATAACTTTATTAATGCAGAAuAAATATAAAGAAGAT^ 

ATATACGTATCTTTATGGAAACTCATTTGCTGCrCCTAAAGTTTCTGGTGCGATTGCAATC 
2 ^ AATT AAAAG ATC AG C CCT AT AATT AT ATGTTTGT AAAAAAATTCTG G AA G AAAC ATT A CC AGT AA 

4106.1 

ATGAAGAAAACATGGAAAGTGTTTTTAACGCTTGTAACAGCTCTTGTAGCTGTTGTGCTTGTGGCCT 

GAACTGCTTCTAAAGACAACAAJ^GAGGCAGAACTTAAGAAGGTTGACTTTATCCTAGACT 

ACCACACAGGGCTTTATGTTGCCAAGGAAAAAGGTTATTTCAAAGAAGCTGGA 

25 CACCAGAAGAAAGTTCTTCTGACTTGGTTATCAACGGAA^GGCACCATITGCAGTGTATTTCCAAGACTACATGGC 
TAAGAAArTGGAAAAAGGAGCAGGAATCACTGCCGTTGCAGCTATrGTTGAACACAATACATCAGGAATCATCTC 
TCGTAAATCTGATAATGTAAGCAGTCCAAAAGACTTGGTTGGTAAGAAATATGGGACATGGAATGACCCAACTGA 
ACTTGCrATGTrGAAAACCTTGGTAGAATCTCAAGGTGGAGACrTTGAGAAGGTTGAAAAAGTA 
CTCAAACTCAATCACACCGATTGCCAATGGCGTCTTTGATACTGCTTGGATTTACTACGGTTGGGATGGTATCCTT 

30 GCTAAATCTCAAGGfGTAGATGCTAACTTCATGTACTTGAAAGACTATGTCAAGGAGTTrGACrACT^ 

TT ATC ATCGC AAAC AACG A CT ATCTG AAAG AT AAC AAAG AAG AAG CTCG CA AAGTC ATCC AAGC C ATC AAAAAAG 
GCTACCA ATAT GCCATGGAACATCCAGAAGAAGCTGCAGATATTCTCATCAAGAATGCACCTGAACTCAAGGAAA 
AACGTGACITTGTCATCGAATCTCAAAAATACTTGT^ 

TTGACGCAGCTCGCTGGAATGCTTTCTACAAATGGGATAAAGAAAATGGTATCCTTAAAGAAGACTTGACAG 
35 AAGGCTTCACCAACGAATTTGTGAAATAA 

4106.4 

ATGAT AAAAA ATCCTAAATTATTAACCAAGTC 1 1TITIAAGAAGTTTTGCAATTCTAGGTGGTGTTGGTCTAGTCA 
TTCATATAGCTATTTATTTGACCnTTCCTTTTTATTATATTCAA 
40 AGTGTTTACGGAGTATTTAAAGACTAAGACATCTGATGAAATTCCAAGCTTACrCCAGTCTTATTCAAAGT 

ACCATATCTGCTCACCITAAAAGAGATATTGTAGATAAGCGGCTCCCTCTTGTGCATGACrrGGATATTAAAG 

GAAAGCTATCAAATTATATCGTGATGTTAGATATGTCTGTTAGTACAGCAGATGGTAAACAGGTAACCGTGCAATT 

TGTT CACGGGGTGGATGTCTA CAA AGAAG CAAAGAATATTTTGCTTTTGTATCT 

ATTGct l 1 1 iCCTTTGiri 1 Tl ITl AriU'rrATACrAAACGCrTGCTCAATCCr ClU ' I ' lUl 'ACATTTCAGAAGTGACT 
45 AGTAAAATGCAAGATTTGGATGACAATATTCGTrrTGATGAAAGTAGGAAAGATGAAGTTGGTGAAGTTG 
CAGATTAATGGTATGTATGAGCACITGTTGAj^GGTTATTTATGAGTrGGA 
TGCAAAATCAAAAGGTTTCCTTTGTCCGCGGAGCATCACATGAGTTGAA 

CCTAGAGAATATGCAGCATAATATTGGAGATTACAAAGATCATCCAAAATATATTGCAAAGAGTATAAATAAGAT 
TGACCAGATGAGCCAC TTATT AGAAGAAGTACTGGAGTCTTCTAAATTCCAAGAGTGGACAGAGTGTCGTGAGAC 

50 CTTGACTGTTAAGCCAGTTTTAGTAGATATTTTATCACGTTATCAAGAATTAGCTC 

AAAATCAATTGACAGATGCTACCAGGGTCGTCATGAGTCTTAGGGCATTGGATAAGGTTTTGACAAACCTGATTA 
GTAATGCAATTAAATATTCAGATAAAAATGGGCGTGTAATCATATCCGAGCAAGATGGCTATCTCTCTATCAAAA 
ATACATGTGCGCCTCTAAGTGACCA AGAA CTAGAACATTTATTTGATATATTCTATCATTCTCAAATCGTGACAGA 
TAAGGATGAAAGTTCCGGTTTGGGTCTrTACATTGTGAATAATATTTTAGAAAGCTATCAAATGGA 

55 CTCCCTTATGAACACGGTATGGAATTTAAGATTAGCTTGTAG 

4106.6 

ATGTATTTAGGAGATTTGATGGAGAAAGCCGAGTGTGGTCAATTITCAATACIT^ 

AGACGACCG TCAA GGCTGTAATGGAAGAAACAGGATTTTCAAAAGCAACCCTAACCAAATATGTCACCCTGCTCA 
OK) ATGACAAGGCITTGGATAGTGGCITAGAGCTGGCTATTCACrCAGAAGATGAAAATCrGCGTCrGTCTATCGGTGC 
AGCTACCAAGG GGAGAG ATATTCGGAGCTTGTITTTGGAGAGTGCTGTTAAATACCAGATTTTGGT^ 
TACCACCAACAG'IU W 1 U'1AG CCCATCAGCTGGCTCAAGAATTGGTGATTAGCGAGGCTACGCTTGGTCGTCACTTGG 
CTGGTTTAAATCAGATTT TGTCAG AATTrGATTTATCCATCCAAAATGGCCGTTGGCGAGGTCCAGAGCATCAGAT 
TCACTATTrCTATTrCTGTCTTTTCCGAAAGGTCrGGTCGAGTCAGGAATGGGAAGGTCAC 
05 AGAAAACAGGAGATTGCCAATTTAGAGGAAATCTGCGGTGCAAGTTTGTCTGCGGGGCAGAAATTGGACTTGGTT 
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CTCTG GGCT CACATCAG TCAAC AACGTCTTCGGGTCAATGCTTGTCAGTTTCAAGTCATAGAAGAGAAAATGCGA 
GGGTATTTTG ACAAT ATCTTTTATCTTCGTTTGCTG AG AAAGGTTCCGTC C 1 ' 1 TTTI GCTGGGCAACATATTCCACT 
AGGAGTTG AGG A TGGTG AG ATG ATGATATTCTTCTCT T 1' I CTCCTATCTCATCGCATTCTTCCTCTTCATACTATGG 
AGTATATTCTTGGTTTTGGAGGGCAGTTGGCAGATTTACTGACGCAATT 

TATTG GGGG ATTATACAGAGGACCATGTCACCTATGAACTCAGTCAGCTTTGTGCTCAAGTCTATCTCTATAAGGG 
CTATATTTTACAGGAT CGCTAC AAGTACCAGTTAGAGAATCGTCATCCATATTTACTGATGGAACATGATTTTAAA 
GAGACAGCAGAGGAGATTTTTCATGCTCTACCTGC1U4UCAACAGGGGACAGATTTAGATAAGAAGATTCTCTGGG 
AATGGCTCCAGTTAATCGAATATATGGCTGAAAACGGTGGCCAGCATATGCGGATTGGTCTGGATTTGACATCTGG 
TTTTCTTGTCTTTTCAAGGATGGCAGCCATTTTGAAACGGTATTTGGAA 

ATGACCCTAGTCGGCATTATGATTTGCTGGTTACCAATAACCCGATTCATAAGAAGGAACAGACACCAGTCTATTA 
TTTAAAAAATGACITGGATATGGAGGATTTGGTAGCGATTCGCCAGTTATTATTCACTTAA 

4106.7 

ATGGAATTTTCAAAGAAAACACGTGAATTGTCAATTAAAAAAATGCAGG 

GGTGG AGGA ATCACAGGAGCTGGTGTAGCCTTGCAGGCGGCAGCTAGCGGTCTTGAGACTGGTTTGATTGAAATG 
CAAGACTTTGCAGAAGGAACATCTAGTCGTTCAACAAAAT^ 

ACGTAGAAGTGGTCTCAGATACGGTTTCTGA^CGTGCAGTGGTTCAACAAATCGCTCCACACATTCCAAAATCAG 

ATCCAATGCTCTTACCAGTTTACGATGAAGATGGAGCAACCTTTAGCCTCTTCCGTCTTAAAGTAGCCATGGA 

GTACGACCTCTTGGCAGGTGTTAGCAACACACCAGCTGCGAACAAGGTTTTGAGCAAGGATCAAGTC^ 

CCAGCCAAACTTGAAGAAGGAAGGCTTGGTAGGAGGTGGAGTGTATCTTGACTTCCGTAACAACGATGCGCGTCT 

CGTG ATTGAAAACATCAAACGTGCCAACCAAGACGGTGCCCTCATTGCCAACCACGTGAAGGCAGAAGGCTTCCT 

CTTTGACGAAAGTGGCAAGATTACAGGTGTTGTAGCTCGTGATCTCTTGACAGACCAj\GTGTTTGAAATCAAGGCC 

CGTCTGGTTATTAATACAACAGGTCCTTGGAGTGATAAAGTACGTAATTTGTCTAATAAGGGAACGCAATTCT 

AAATGCGCCCAACTAAGGGAGTTCACTTGGTAGTAGATTCAAGCAAJVATCAAGGTTTCACAGCCAGTT^ 

ACACAGGTTTGGGTGACGGTCGTATGGTCTTTGTTCHrCCACGTGAAAAC 

AGACTACACAGGTGATTTGGAGCATCCAAAAGTAACTCAAGAAGATGTAGATTATCTACTTGGCATTGTCAACAA 
CCGCTTCCCAGAATCCAACATCACCATTGATGATATCGAAAGCAGCTGGGCAGGTCTTCG 

TCCATTGATTGCAGGGAACAGTGCCTCTGACTATAATGGTGGAAATAACGGTACCATCAGTGATGAAAGCTTTGA 
C AACTTG ATTGCG ACTGTTG AATCTTATCTCTCC AAAG AAAAAACACGTG AAG ATGTTG AGTCTG CTGTCAGCAAG 
CTTGAAAGTAGCACATCTGAGAAACATTTGGATCCATCTGCAGTTTCTCGTGGGTCTAGCTTGGACCGTGATGACA 
ATGGTCTCrTGACTCTTGCTGGTGGTAAAATCACAGACTACCGTAAGATGGCTGAAGGAGCTATGGAGCGCGTGG 
TTGACATCCTCAAAGCAGAATTTGACCGTAGCTTTAAATTGATCAATTCTAAAACTTACCCT 

ATTGAACCCAGCAAATGTGGAT TCAG AAATCGAAGCCTTTGCGCAACTTGGAGTATCACGTGGTTTGGATAGCAA 
GGAAGCTCACTATCTGGCAAATCTTTACGGTTCAAATGCACCGAAAGTCTTTGCACTTGCTCACAGCTTGGAACA 
GCGCCAGGACTCAGCTTGGCAGATACTTTGTCCCTTCACTATGCAATGCGCAATGAGTTGACTCTTAGCCCAGTTG 
ACTTCCTTCTTCGTCGTACCAATCACATGCTCTTTATGCGTGATAGC 

GAAATGGGACGATTCTATGACTGGACAGAAGAAGAAAAAGCAACTTACCGTGCTGATGTCGAAGCAGCTCTCGCT 
AACAACGATTTAGCAGAATTAAAAAATTAA 

4106.8 

ATGATGAATGAATTATTTGGAGAATTTCTAGGGACTTTAATCCTGATTCTTCT 

TGGTTC TTCCT AAAACCAAGAGCAATAGCTCAGGTTGGATTGTGATTACTATGGGTTGGGGGATTGCAGTTGCGGT 
TGCAGTCTTTGTATCTGGC AAGCT CAGTCCAGCTTATTTAAACCCAGCTGTGACCATCGGTGTGGCCTTAAAAGGT 
GGTTTGCCTTGGGCTTCCGTTTTGCCTTATATC^ 

GTTGCAATTCAAACCTCACTATGAGGCAGAAGAAAATGCAGGCAATATCCTGGCAACCTTCAGTACTGGACCAGC 

^^TlE^^^^ ^^^^T?^^^^^^^^*^^^* ^'^^G AAATCCTTGG AACTTXTGTTTTGGTGTTG ACAATCTTTG CTTTG G 

GTCTTTACGACTTTCAGGCAGGTATCGGAACCTTTGCAGTGGGAACrTTGATTGTCGGTATCGGTCTATCACTAGG 

TGGGACAACAGGTTATGCCTTGAACCCAGCTCGTGACCTTGGACCTCGTATCATGCACAGCATCTTGCCAATTCCA 

AACAAGGGAGACGGAGACTGGTCTTACGCTTGGATTCCTGTTGTAGGCCCTGTTATCGGAGCAGCCTTGGCAGTG 

CTTGTATTCTCACTTTTCTAG 

4106.10 

ATGAAAAG GACCT GGAGGAACTCATTCGTGACAAATCTTAATACACCTTTTATGATTGGCAATATTGAGATTCCCA 

ATCGTACCGTTTTAGCGCCTATGGCTGGCGTGACCAACTCAGCCTTTCGTACTATCGCAAAGGAGCTCGGAGCTGG 

ACTCGTTGTAATGGAAATGGTCTCTGACAAGGGAATCCAATACAACAACGAAAAAACCCTGCACATGCTTCATAT 

CGATGAGGGCGAAAACCCTGTCTCTATCCAACTTTTTGGTAGCGATGAAGACAGCCTAGCACGCGCAGCAGAATT 

CATCCAAGAAAACACCAAGACCGATATCGTCGATATCAACATGGGCTGCCCTGTCAACAAAATCGTGAAGAACGA 

AGCTGGTGCTATGTGGCTCAAGGATCCAGACAJ\GATITACTCCATCATCAACAAGGTCCAGTCTGTCCTTGATATC 

CCACTTACTGTCAAAATGCGTACCGGCTGGGCGGACCCATCTCTTGCAG 

CAGGTGTTTCTGCCCTCGCCATGCATGGCCGTACCCGTGAACAAATGTATACTGGCCACGCAGACCTTGAGACCCT 
TTACAAGGTTGCCCAAGCTCTAACCAAGATTCCATTCATCGCCAACGGTGATATCCGTACTGTCCAAGAAGCCAA 
GCAACGCATCGAAGAAGTTGGTGCTGACGCAGTCATGATTGGCCGAGCTGCCATGGGAAATCCI^ 
CCAAATCAACCATTACTTTGAA^CAGGAGAAATCCTACCTGAT^ 

CGAACACTTGAAACGATTGATTAACCTCAAAGGAGAAAACGTCGCAGTTCGTGAATTCCGCGGTCTCGCTCCTCA 



SUBSTITUTE SHEET (RULE 25) 



WO 00/06737 



33 



PCT/GB99/02451 



CTATCTCCGTGGAACATCTGGCGCTGCCAAACTCCGTGGAGCCATTTCGCAAGCCAGCACCCTGGCAGAGATTGA 
AACCCTCTTGCAATTGGAGAAGGCTTAA 



4107.1 

ATGACAAAGAAG AAAATTGAGCGTATTTCTGTAATACACCGAG AAAAG A I' 1 11 ATGGCTCAAGTGGTATTTCATGC 

GAGATAAAGAACAACCTAAGTATAGTGTCCITGAGCGTAAAATGTTTGATGCTGCTAAAAATCAAGATA 

CTTATCAAAAATACGCAACTATCAJ\GCAGATAACAGATATTAGGGTACAAACAAGTGAGGCTGACATTTTAGAGG 

CTGTAAAAGAGGTTTATGTGTACAuATCACATGAATGTTATCGGAGCTTGTCAGCGGATATTATTTATCAGTCAATC 

ACCAGCTTATGATAAGTTAAATAAGTGGTTTAATATCTATTCTGATTTGTATTTTAGCGT^ 

TGGGGGT AT ATC ATG AG ATGGT AG GT ATCT AG 

4107.2 

ATGAAAAATTCCAACGAGGCTGAGATGAAATTACTTTATACTGATATTCGGACTTCrrTGACA 
GAGAGGCAGAAGAGCTAGTTGCAGCTGGCAAGCGGGTCTTCTACATTGCCCCCAACTCTCn I CTTTTGAAAAGGA 
ACGCGCCGTGCTGG AATACTTGTCCCAGCAGGC I I CIV 1" 11 CGATTACCGTCACGCGCTTTGCTCAAATGGCTCGC 
TATCTGGTCTTGAATGATTTACCAGCTAAAACTACTCTTGATGATATCG 

TGCCGAACTCGATCCCAAGGACTrGCGTGTTTATGGCGCTATTAAGCAGGATCCTCAATTGATCCAGCAGTTAATT 
GAGCTTTACCATGAGATGACCAAATCTCAGATGAGTTTTTTGGACTTGGAGAATTTAACAGATGAGGATAAGAGG 
GCGGATTTACTCTTGATTTTTGAGAAAGTAACAGCCTATCrrAATCAAGGTCAGTTAGCCCAGGAAAGTCAGTTGT 
CCCATTTGATTGAGGCTATTGAGA^TGACAAGGtAAGTAGTGATTTTAATCAAATCGCCTTG 

TACTCG'lUUU'rCTGCTGAGGAj\GAGCGGGTTGTGGACTTACITCACGGCAAAGGTGTTGAGATTGTTATCGGGGCT 
TATGCTAGTAAGAAAGCCTATACCAGTCC1 T i l AGCGAGGGCAATCTCTACCAAGCCAGCGTAAAATTTCTCCATC 
ATCTGGCrrCTAAATACCAAACGCCT GCTCA GGACTGTTC TCAA ACTCATGAGAAGATGGATAGTTTTGACAAGGC 
CTCTCGTTTGTTGGAGTCITCTTATGACTTTTCAGAACTCGCTTTGGATGTCGATGAGAj\A 

CAAATCTGGTCTTGTTTGACGCAAAAGGAGGAGTTGGAGCTAGTAGCCCGTAGTATTCGTCAGAAATTACATGAG 
AACTCAGACCrGAGCrACAAGCATTTTCGTATTCTCTTGGGGGATGTAG Cl TCV1 ACCAGTTATCTCTCAAAACCA 
r 1 ' I TI G A CCAG TA TCAG ATTCCTTTn ATCTTGGTAGAAGCGAAGCCATGGCTCATCATCCCTTGACTCAGTTTGTC 
GAGTCTATTTTAGCrrTAAAACGTTACCGTTTTCGTCAGGAGGATTTGATTAATCTTCTTAGA 

TGACCTCAGTCAGTCTGATATTGATGCTTTTGAGCAj\TATATCCGCTATCTTGGTATCAjVTGGCTTGCCAGCCTTTC 
AGCAAACCTTCACCAAATCCCACCATGGAAAATTTAATCTrrGAGCGTTTGAATGTCCTCCGC 
ACCTCTTGAAACCCTCITTGCCAGCCGAAAACAAAAGGCTGA^ 
AGAAGGAGCTGTGACCAj\GCAGTTACAAGATITGACAACCACTTTGGAA^ 

AAGTTTGGAAGGCTTTCTGCCATGTITTAGAACAATTTGCGACTG 1TIV1GCTGGTTCACAGGTTAGTCTGGAAGA 
CTTCCTAGCCTTGCTCCATTCTGGAATGAGTTTGTCCCAATACCGTACCATTCCAGCAACAGTGGACACTGTTCTG 
GTGCAGAGTTACGATTTGATTGCACCATTGACTGCTGACTTTGTCT 

C AAAAATTTCTC AAA AC A C C AGTCTTCTG AC AG ATG AAG A AAGG C AAAACCT AAAC C AAGCG AC CG AAG AAG G C 
GTTCAATTACTGATTGCCAGCAGTGAAAATCTCAAGAAuAAATC^ 

GTAAGCAGTTGTTCTTGTCGGCTCCAAGCC I" 1 11 I AACGAAAGTG AAAGT AAGG AATCTGCCTATCTTC AAG AGTT 
GATCCATTTTGGATTTAGGCGGAGAGAGAAGAGGATGAATCACAAAGGACTGTCTAAGGAGGATATGGGGTCCTA 
TCACAGTCirTTGTCTAGTCTGGTTGCCTATCACCAGCAGGGTGAGATGAGCGATACTGAGCAAGATTTGAClU'1'1' 
GTCAAGGTTCTGTCGCGTGTCATAGGTAAAAAACTAGATCAGCAAGGTCTGGAAAATCCAGCTATCCCAACCAGT 
CCAAGCAGCAAGACCTTAGCCAAGGACACCTTGCAAGCTCT 

CT ATCCTGCCAAACAGGAGTTTTACCrGTCTACGTCGGGTTTGACAGAGTTTTATCGCAATGAATACAG 
CTACGCTACGTTTTAGGCITGCAGGAGGAATTACGTTTGCATCCrGATGCCCGTAGTCACGGGAATT^ 
GTATCTTTGAACGCGCCTTACAGTTGCCTAATGAAGATTCCrrrGACCAACGTCTAGAA 
CAGTCAAGAACGCGAATTTGAAGCTATTTATCAAGAAAGTTTGGAAGCCCAG 

GTTGCACGGACAACTGGACATATTCTCCGACACAATCCAGCCATCGAAACCATCAjVAGAAGAAGCAAATTTTGGT 
GGAAAAGACCAAGCCTTrArrCAATTAGACAATGGACGCAGTGTCTTTGTACGAGGCAAGGTGGACCGGAT^^ 
CGTTTGAAAGCTAATGGAGCGATAGGAGTAGTAGACTACAAATCCAGTCrGACTCAGTTCCAGTTTCCTCATTT 
TTAATGGGCTCAATTCrCAGTTACCAACCTATCTTGCrc^ 

CATGTACTTGGAAATGGCTGAj\CCTGTCCAATCrCTGATGGCGGTAAAAAGTCrGGCAGGAGCAGTGGTAGAAGC 

CAGCAAATCTATGAAATACCA^GGGCTCTTCTTGGAAAAAGAAAGCAGTTATTC 

CAAGGCTAATCAACTGACAGATGAGGAATTrCACCrCCTACTGGACT 

GAGAAGATTTTAGCAGGCCG GTTC GCCATCAATCCTTATACTGAAAATGGCAGAAGCATTGCCCCATACGTCCAG 
CAACATCAGGCTATTACAGGCTrTGAAGCCAATTACCATCTGGGCCAAGCCCGTTTCCTAGA 
CTGATGGCAAGCGTCTGGTCGGAGAAAAACTCAAGCAAGCTTGGCTTGAAAAAATAA 
GAGAGGAGTTGAATCGATGA 

4107.3 

ATGAAGCTTATTCCCTTTTTAAGTGAGGAGGAGATTCAAAAACT 

CAGAAGAAAACTGCCGAGCAAATCGAAGCTATCTACACTTCTGCCCAGAATATCCrGGTCTCAGCATCGGCTGGT 
TCTGGAAAGACCTTTGTCATGGCAGAGCGCATTCTGGACCAATTGGCGCGTGGTGTCGAA 
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TCTCAACCTTTACCGTCAAGGCTGCAACTGAACHTA 

AAACAGATGATGTCGACCTCAAACAACACnTGGGTCGCCAGTTGGCAGACCTACCCAACGCrGCCATTGGAACCA 

TGGATTCTTTCACACAAAAATTCCTTGGCAAACATGGTTATCTGCTTGATATTGCACCT 

AACCAAAGCGAGCAACTTATTCTCGAAAACGAAGTCITTC^ 

AAAGAGACCTTrAGTCATTTGCTGAAAAACTTTGCTGGGCGTGGCAAGGACGAACGGGGTCT 

TATAAAATCTATGACTTCCTCCAATCCACC\GTAATCCTCAAAAGTGGCTGAGTGAATCTTTC 

AGAAAGCTGATTTTACCAGTGAAAAAGAAAAACTGACCGAGCAAATCAAACA^ 

TTTTCCGTTACCATCTGGATAACGATGCCAAGGAGTTTGCAAAGGCTGCCTATTTAGAAAATC 

GGATGAAATTGGCTCCCTAAATCAGGAGTCCGATAGTCAGGCTTATCAGGCAGTGCTTGCGCGTGTTGTCGCCATC 

TCTAAGGAGAAAAACGGTCGA GCTC TGACTAATGCCAGCCGTAAGGCTGATTTGAAGCCCCTGGCTGATGCCTAC 

AACGAAGAGAGAAAGACCCAGTTTGCTAAACTAGGACAATTATCAGACCAGATAGCGAT 

TCTCGACTATCAAGAACGTTATCATGGAGACACTTGGAAACTAGCTAAAACC 

G TAGA GGCTTATCGTCAGAGAAAACGACAGGAAAATGCCTTCGAATTCGCTGATATCAGCCATTACACCATTGAG 

ATTTTAGAGAATTTCCCACAAGTTCGTGAGTCTTATCAGGAGCGCTTCCATGAAGTCATGGTCGATGAGTATCAGG 

ATACCAACCATATTCAAGAACGGATGCTGGAATTGTTGTCTAATGGCCACAATCGCTTTATGGTGGGAGATATCAA 

GCAATCCATCTATCGTTTCCGTCAGGCAGACCCGCAGATTTTCAATGAGAAATTCCAACGCTATGCGCAAAATC^ 

CAAGAAGGCAGGCTCATTATCCTCAAGGAAAATTTCCGTAGTAGTTCAGAAGTGCTGTCAGCAACCAATGATGTC 

TTTG AACGTCTCATGG ACCAAG AGGTCGGCG AAATC AACTATG ATAACAAGCACCAGCTTG IT1 ' I TGCCAATACCA 

AACTGACTCCCAATCCAGACAACAAGGCAGCATTTCTCCTCTACGACAAGGACGATACAGGTGAGGAAGAAGAGA 

GTCAAACAGAAACGAAACTAACAGGCGAAATGCGCTTAGTTATCAAGGAGATTCTGAAACTTCATCAAGAAAAAG 

GTGTTGCCTTTAAGGAAATTGCCCTTCTGACCTCCAGCCGCAGTCGTAATGACCAGATTCTCCTCGCCCTGTCTGA 

GTACGGAATTCCTGTCAAAACTGACGGAGAGCAAAACAATTATCTCCAATCCCTAGAACrrGCAAGTCATGCTAGA 

CACTCTTCGTGTCATTCACAATCCCCTGCAAGACTACGCCTTGG 

ATGAGGATGAGCTAGCACGTTTGTCCCTTCAGAAAGCAGAGGATAAAGTCCACGAAAATCTCTATGAGAAACTGG 
TC AATG C AC AAAAAATGG C AAGT AGTC AAAAA GG CTTG ATTC AC AC AG CTCT AG CTG 

AAAAACTAAAGCAATTCATG GATATC CTAGC1 IXU'rGGCGCTTGTATGCCAAAACCCACTCTCTCTATGACTTGAT 

TTGGAAGATTTACAACGACCGTTTTTATTATGACTATGTTGGGGCTTTGCCGAATGGTCCT 

CTCTATGCCCTAGCACTGCGTGCTGATCAATTTGAAAAGAGCAATTTCAAAGGTTTGTC^ 

TTGACCAAGTCTTAGAAGCCCAGCACGATTTGGCAAGCGTGGCCGTCGCACCGCCAAAAGATGCAGTAGAGCTCA 
TGACCATCCACAAGAGTAAAGGGCTGGAGTTTCCTTACGTCTTTATCCTCAmATATGGATCAAGATTTCAACAAGCA 

agactctatgtcagaagtcattctcagtcgtcagaatggtcttggtgtcaaatatattgccaagatggagacaggg 

gcagtagaagaccactatcctaaaaccatcaaactctccattcctagtctgacctataggcagaacgaagaggaa 

ttacagctagcaagctattctgagcagatgcgttrgctgtatgttgcratgacgcgggctgagaaaaagctct 

trgtcggcaagggttctcgtgaaaagctggaatccaaggaatacccagcagccaaaaatgggaaactaaatagca 

atactagactgcaagcacggaatttccaagattggctttgg 

actttagttatcgttttattggcgaagatcagttgaccagagaagctatcggagagttggaaaccaagagtcct 

ccaagatagctcccaagcagacaatcgtcagtcagataccatcaaagaagctctggaaatgctgaaggaggtgga 

agtttataatactcttcaccgcgcagctattgaacttcctagtgttcaaaccccaagtcaaatcaag 

gaaccagtratggatatggaaggtgtcgagattgctggtcaaggtcagtcagtaggcaagaaaatcagcttcgat 

ttgccagatttttcaaccaaagaaaaggtaacrggagctgagattggtagtgctactcacgaacrcatgcagaga 

attgacctcagccagcaactaacccttgctagcctaacagaaacactcaaacaagttcaaactagccaagct 

agagacaagatcaatcttgataaaattcttgcrrrcrrigacacagtactcggtcaggaaattct^ 

accatcittatcgcgagcaacctttcrccatgctcaaacgagaccaaaagagtcaggaa 

tatccttgatggctatctgcittacgaaaacaaaattgttctgttcgactacaagacagaccg 

agtcaactcgtagaccg ctatc gtggtcagttagctctatacgaagaggctttatcacgagcctattcgattgaaa 

atattgaaaaatacttgattttacrcggtaaagacgaggttcaagttgtaaaagtataa 

4109.1 

ATGGAACITGCTCGCCATGCrGAAAGCrTGGGAGTAGATC 

CAGAATACTCAGTTGCCAAATACTGGAACGATATCAGTTCTGCAGCTCCAAACACAGACTACGTGATTTACAACA 
TTCCTCAATTGGCAGGGGTTGCITTGACTCCAAGCCm 

GAAGAACTCTTCTATGCCAGTTCAAGATATCCAAACCrrrGTCAGCCTTGGTC 

GGTCCTGATGAGCAGTTCCTAGGAGGACGCCTCATGGGGGCTAGGGCTGGTATCGGTGGTACTTATGGTGCTATGC 
CAGAACTCTTCTTGAAACTCAATCAGTTGATTGCGGATAAGGACCTAGAAA 

CAACGCAATCATTGGTAAACTCACrTCTGCrCATGGA^ATATGTACGGTGTCATCAAAGAAGTCTTGAAAA 

GAAGGCTTGAATATTGGATCTGTTCGTTCACCATTGACACCAGTGACTGAAGAAGATCGTCCAGTTGTAGAAGCG 

GCTGCTGCCTTGATTCGTGAAACCAAGGAGCGCTTCCTCTAA 

4110.2 

ATGTATAAGACAAAGTGTTTACGAGAGAAGTTAGTATTATT 1 1 1 AAAAA 11 ITC1 1 CCCAATCCTG ATCTACCAAT 
TTGCCAATTATTCTGCCTCrrrTGTT GATACT GCAATGACAGGTCAA 

ATGGCAACCAGTATCTGG AATCCT T TCT 1 1 ACATTTCTAACAGGGATTGTGTCAGCCTTGGTGCCTATCATTGGTCA 

CCATCITGGTCGAGGCAAAAA GGAAG AAGTTGCGTCTGATTTTTACCAATTTATTTA 

GTGGTCTTGCTGGGGATGGTACTTTTC 

CAGTAGCGGTTCGCTATCTTTGGTTTTTATCrATCGGGATTATCCCCTTGTTGCTCTTTAGCGTCA 



SUBSTITUTE SHEET (RULE 23) 



WO 00/06737 



35 



PCT/GB99/02451 



TGGATTCGCTGGGCTTGACCAAACTGTCCATGTACCTCATGCTTTTGTTACTCCCTCTC 

CTCTTGATTTACGGTGCCTTTGGTGTTCCAGAACTGGGAGGGGCTGGTG 

GGGTCTTGCrTGGGATTTCTGTTCTGGTTTTATTTAAACAGGAGAAGCTCAAAG 

TCCACTTAATATGGATAAAATTAAGGAAGGAGTTCGTTTAGGTCTGCCTATTGGGGGAACTGTC^ 

GCT ATC1 TT rCAGTGGTTGGCTTG ATT ATGGCTAAGTTTTCGCCCTTG ATT AT AGCTAGTCACCAGTCAGCTATG AA 

CTTTTCAAGTCTTATGTACGCCTTTCCTATGAGTATCTCATCGGCTATG 

CCAAGCGATTTGATGATGCGAAAACCTATATTGGTCTAGGAAGATGGACTGCCCTCAIUUUIGCGGCCTTCACC^ 
AACCTTCCTTTACATTTTTAGGGGAAATGTGGCCAGTCTTTATGGTAACGACCCAAAATTTATCG 
CGTTTTTTAACTTATAGTCl'VriC 11 CCAGTTAGCAGATACCTTTGCGGCGCCGCTTCAGGGAATTTTGCGGGGGTA 
TAAGGATACAGTTATTCCTTTTTACCTTGGTTTGCTTGGTTATTGGGGCGTAGCAATCCCT 

4112.2 

ATGAGTACTTTAGCAAAAATAGAAGCGCTCTTGTTTGTAGCGGGTGAAGATGGGATTCGGGTCCGCCAGTTAGCT 
GAACTCCTCTCTCTGCCACCGACAGGCATCCAGCAAAGTTTAGGAAAATTAGCCCAGAAGTATGAAAAG GACC CA 
GATTCCAGTITGGCrTTGATTGAGACAAGTGGTGCrrATAGATTGGTGACCAAGCCrCAATT^ 
AGGAATACTCTAAGGCGCCTATCAACCAGAGCTTGTCTCGGGCrGC^ 

ACAGCCGATTACGCGGATAGAAATTGATGCCATCCGTGGAGTTAACTCGAGTGGAGCCTTGGCAAAGTTGCAGGC 
TTTTGACCTGATAAAGGAAGACGGGAAAAAGGAAGTATTGGGGCGCCCCAA^ 

CCTAGATTACATGGGGATAAACCATTTAGAAGAATTACCAGTGATTGATGAGCTTGAGATTCAAGCCCAAGAAAG 
CCAATTATTTGGTGAAAGGATAGAAGAAGATGAGAATCAATAA 

4113.1 

ATGGATACGATGATTAGTAGATTTITTCGCCATrrATTTGAAGCCTTAAAAAGT^ 

CAGTAGCTGCTGTCAGTTCAGTCATGATTACTTTGACCTTGGTGGCAATATTTGCATCTGTTATT^ 

AAACTAGCTACAGATATTGAAAATAATGTCCGTGTAGTAGTTTATATCCGAAAGGATGTGGAAGATAATAGTCAG 

ACAATTGAAAAAGAAGGTCAAACTGTTACAAATAATGACTACCACAAGGTATATGATTCTTTGAAGAACATGTCT 

ACGGTTAAAAGTGTTACCTTTTCAAGTAAAGAAGAACAATATGAAAAATTAACCGAGATAATGGGAGATAACTGG 

AAAATCTTTGAAGGAGATGCCAATCCTCTCTATGATGCCTATATTGTAGAGGCAAACACTCCAAJ\TGATGTAAAA 

ACTATAGCCGAAGATGCTAAAAAAATTGAAGGTGTCTCTGAGGTTCAAGATGGCGGTGCCAATACAGAAAGACT 

TTCAAGTTAGCTTCATTTATCCGTGTTTGGGGACTAGGGATTGCTG 

TTCAAATACCATTCGTATTACCATTATTTCCCGCAGTCGCGAA^ 

ACrrTATATCCGTGGACCGTrCTTGTTAGAAGGAGCCTrTATCGGTTTATTGGGAGCTATCGCACCAT 
CrrTATTGTTTATCAAATTGTTTACCAATCTGTCAACAAATCGTTGGTAGGG 
GATTTATTTAGTCCGTTGATGATTGCCCTACTATTTGTGATTGGGGrrTTCATTGGTTCATTGG 
CATGCGCCGATTCTTGAAGATTTAG 

4117.1 

ATGAAGAAAGTAAGATTTATTTTTITAGCTCrGCTATTTTTCT^ 

GTACTTGGCAAGGAAAACAGTATCTGAAAGAAGATGGCAGTCAAGCAGCAAATGAGTGGGTTTrTGATACTCATT 

ATCAATCTTGGTTCTATATAAAAGCAGATGCTAACTATGCTGAAAATGAATGGCTAAAGCAAGGTGACGACTAT^ 

TTACCTCAAATCTGGTGGCTATATGGCCAAATCAGAATGGGTAGAAGACAAGGGAGCCTTTTATTATCTTGACCAA 

GATGGAAAGATGAAAAGAAATGCTTGGGTAGGAACTTCCTATGTTGGTGCAACAGGTGCCAAAGTAATAGAAGAC 

TGGGTCTATGATTCTCAATACGATGCTTGGTTTTATATCAAAGCAGATGGACAGCACGCAGAGAAAGAATGGCTC 

CAAATTAAAGGGAAGGACTATTATTTCAAATCCGGTGGTTATCTACTGACAAGTC^ 

TGAATGCTAGTGGTGCCAAAGTACAGCAAGGTTGGCTTTTTGACAAACAATACCAATCTTGGTTITACATCAAAGA 
AAATGGAAACTATGCTGATAAAGAATGGATTTTCGAGAATGGTCACTA 

GCAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATTTGATGGGAAAATGGCTGAAAA 

TGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACrrCAAATCCGGTGGTTACATGACAGCCAATGAA 

GGGATAAGGAATCrTGGTTTTATCrCAAATCTGATGGGAAAATAGCTGAAAAAGAAT^ 

GTCAAGCrTGGTACTACTTCAAATCCGGTGGTTACATGACAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTT 
TTACCTCAAATCTGATGGGAAAATAGCTGAAAAAGAATGGGTCTACGATTCTCATAG 

AAATCTGGTGGCTACATGGCGAAAAATGAGACAGTAGATGGTTATCAGCTTGGAAGCGATGGTAAATGGCTTGGA 
GGAj\AAACTACAAATGAAAATGCTGCnTACTATCAAGTAGTGCCTGTTACAGCCAATG 

AAAAGCTTTCCTATATATCGCAAGGTAGTGTCGTATGGCTAGATAAGGATAGAAAAAGTGATGACAAGCGCTTGG 
CTATTACTATTTCTGGTTTGTCAGGCTATATGAAAACAGAAGATrTACAAGCGCTAGATGCTAGTAAG 
CCCT TATTATGAG AGTGATGGCCACCG 1 1 1 1 1 ATCACTATGTGGCTCAGAATGCTAGTATCCCAGTAGCTTCTCAT 
CTTTCTGATATGGAAGTAGGCAAGAAATATTATTCGGCAGATGGCCTGCATT^ 
CCITCCTTTTCAAAGATTTAACAGAGGCTACAAACTACAGTC 

CATTAACAATAGCCrTTTGGAGAACAAGGGCGCTACTTTTAAGGAAGCCGAAGAACAT^ 

TATCTCCTTGCCCATAGTGCCCTAGAAAGTAACTGGGGAAGAAGTAAAATTGCCAAAGATAAGAATAA'rri'Crri'G 

GCATTACAGCCTATGATACGACCCCnTACCnTTCTGCTAAGACATTrGATG 

AACCAAGTGGATTAAGGAAAATTATATCGATAGGGGAAGAACrTTCCnTGGAAACAAGGCT^ 

GGAATATGCTTCAGACCCTTATTGGGGCGAAAAAATTGCTAGTGTGATGATGAAAATCAATGAGAAGCTAGGTGG 
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4U9.2 

ATGAAAAAAGTATTACAAAAATATTGGGCATGGGCTTl 

ATGTGCCGATGTTTCAAGGAGCC1 rri'ACAGTTTTACCAACTGGACAGGATTGACTTATAACTACAAATTTGTTGG 
CTTAAACAACTTTAAGCrCCTCTTCATGGATCCAAAATTCATGAATGCGATTGGCTT^ 

GCCATGGTGGTTGGTGAGATTGCACTCGGGATCTTCATTGCGCGTGTCnTGAATTCTAj^AATCAAAGGCCAAACCT 
TCTTCCGTGCTTGGTTCTTCrTCCCAGCTGTTTTATCTGGTTTGACAGTGGCT^ 

TACGGTCTTCCAGCGATTGGAAATGCCCTTCATATTGAAVri'ri'CCAAACCAGTCrrr IAGGGACTAAGTGGGGAG 
CAATCTTTGCGGCTGTCTTTCrrCCTTCTTTGGCAAGGGGTGGCTATGCCCATCATCATCr^ 

TCTATTCCAACTGAGATTACAGAGGCAGCAAGGATTGATGGTGCGACTAGCAAGCAAGTTTTCTGGAACATTGAA 

TTGCCTTACTTGCTACCAAGTGTCTCTATGGTCTTTATCCTAGCCCTAAAAGGTGGGCTGACTGCCTTT GA 

CTTTGCCATGACCGGTGGTGGTCCAAACAATGCCACAACCTCACTTGGGCTCTTGGTTTATAACTATGCCTT^ 

AACAACCAATTCGGTTATGCCAATGCCATTGCCGTAATCTTGTTCTTCTTAATTGTAGTGATTTCGATCATCC 

GAGAGTATCTAAGAAATTTGAAATTTAA 

4119.3 

ATGATGAAACAAGATGAA AGAA AAGCCCTGATTGGCAAATACATrCTATTGATTCT^ 

TGCCGCTCCTTGCTACCCTCTTTAGTTCCTTC AAACCC ACT AAGG AT ATTGTAG AT AA" l'l" rCTTI GGCTTTCCAACC 
AACTTCACATGGGACAACrrTAGCCGTCTCHTAGCTGATGGGATTGGA 

CrGTCrrGTCTTTACTTGCAGTAATGATCTTTATCCCTATGGCAGCCTACTCCATCGCTCGCAATA 

AAAAGCCnTTACCATCATGTATACCCTCTTAATCCTCGGAATCTTCGTACCTTTCCAAGTCATCAT^ 

CGGTTATGATGAGTAAACTCGGTTrGGCTAATACCTTTGGTrrGATCTTGCTCTACTTGACCT 

ACCCTCTTTCTCTATGTTGGCTATATCAAAATCTCGATTCCAGAAAGTCTGGATGAAGCAGCAGAGATCGATGGGG 
CTAATCAATTTACAACCTATTTCCGCATCATCTTCCCAATGATGAAACCGATGCATGCGACAACCATGATCATCAA 
TGCCCTITTGGTTCTGGAATGACTrCATGTTGCCACTCCTTGTCrrc 
TGTTCCAATACAACTACGCAGGCCAATATTTCAACGACTACGGACCAj^GCTTTGCCT 

CAGTATCACCATTGTCTATCrCTrCrrCCAj\CGCCATATCATTTCAGGAATGAGCAACGGGGCAGTGAAGTAA 
4119.4 

ATGAAAAGTATTCTTCAGAAAATGGGGGAGCATCCGATGCTG Cnc rrr 1 1 C 1 1 AG CT AT AGTACTGTT AT ATCC A 
TTCTTGCACAAAATTGG ATGGGTCTTGTGGCTTCAGTAGG AATGTTTCTATTTACTA l'l " rTCTTTTI GCACTATCAG 
TCGATTTTATCCCATAAATTCTTTCGATTGATTTTGCAGTTnrGTCTTGTTTGGTAGTGT 
AGTTTAGAACATTTCCAAA TTGTG AAGAAATTTAACTATGCnTrC 

ACCGGGCAG AAGTG ACC1" VCTT I'AATCCTAATTATTATGGAATTATTTGTTGTTTCTGTATTATGATTGCTTTCTAT 

CTGTTTACAACGACCAAGTTGA^TTGGTTGAAAGTATTCTGTGTGATTGCAGG 

CTTTACTCAAAATCGAACTGCCTTrCCTGCTATTATCGCTGGAGCAATTATCTATCT 

GG AAGGC C 1 ' lU ' l GGCTrAGTATTGGGGTCTTCGCGATTGGTTTGAGTTTCCT C 1 T i ' l 'CTAGTG ATTTGGG AGTTCG A 
ATGGGTACTITAGACTCrrCTATGGAAGAACGCATTTCTATCTC^ 

CT l"l" r 1 GGGGTGAAGGGCCATTGACCTATATGAACTCTTATCCTCGGATACATGCTCCTTATCATGAACATGCCCA 
CAGTCrTTATATTGATACGATTCTGAGTTACGGAATTGTGGGGACTATTTTATTAGTTTT 

TTCGCTTGATGATGGATATGAGTCAGGAGTCGGGGAAACGTCCGATTATCGGCCTTTATCTATCTTTCCrTACAG 

GGTTGCTGTGCACGGAATrTTTGACTTGGCTCTCrTCT^ 

GTATTCCATTGGAGCATCGAATGTTGGTATCGGACATGACGGATTAA 

4L20.1 

ATCTCAAAGATGGATGTT CAGA AAATCATTGCACCGATGATGAAGTTTGTGAATAT^ 

AAGATGGGATGTTAGCAATTTrGCCATTGACAGTAGTTGGTAGTTTGTTCTTGATTATGGGACAATTGCCGT^ 
AGGATTAAATAAGAGCATTGCTAGTGTTTTTGGAGCTAATTGGACAGAGCCGtTTATGCAAGTATAT^ 
TTTGCT ATT ATGGGTCTAATTTCTTGTTTTTCAATTGCCT ATT CITATO 
AGCTGGAGTTCTATCTGTATCTGCATTCTTTATTrTGCTAAGATCATCT 

GGGACGCTATTAGTAAAGTTTGGTTTGGAGGCCAAGGAATTATCGGTGCTATCATTATAGGTTTGGTAGTAGGAAG 
TATTTATACCncrT'rATAAAGAGAAAAATTGTTATTAAGATGCCAGAACAAGTTCCACAAGCTATTGCCAAACAG 
TTTGAAGCAATGATTCCAGCATTTGTAATTTTCITATCTTCTATGATTGTATATA 
TGGCGGAACATTCATAGA^TGATTTATTCTGCTATTCAAGTTCCGTTGCAAGGTTTAACT 

GCTATTGGAATTGCATTCTTTATATCAr'T^UUGTGGTGGTTTGGTGTTCATGGGCAATCGGTAGTAAATGGAGTAGT 

GACAGCTCTGCTTTTATCTAATCTITGATGCTAATAAAGCTATGTTAGCCTCTGCT 

GCACATATTGTTACrCAACAATTTTTAGATTCATTTTTA^ 

TGCCATGCTTTTTGCAGCAAAATCAAAACAATACCAAGCCTTAGGAAAAGTTGCA 

GTAAATGAGCCAGTTGTATTTGGATTTCCGATTGTCATGAATCCAGTTATGTTTGTACCrrTCATrCT^ 

ACTTGCAGCTGTG ATAG TATATGGAGCTATTGCAACAGGTTTCATGCAGCCATTCTCAGGGGTAACATTGCCTTGG 

AGTACACCAGCTATTTTATCAGGATTrTTGGTGGGTGGATGGCAAGGAGTTATTACTCAGCTGGTGATATTAGCGA 

TGTCTACATTGGTTTATTTTCCATrCTTTAAAGTACAGGATCGT^ 

4L21.2 

ATGAAGAAAAAGGACTTAGTAGACCAACTAGTCTCAGAGATCGAGACGGGGAAAGTCAGGACACTGGGAATATA 
CGGTCATGGAGCTTCAGGTAAATCAACCTTTGCACAGGAATTGTACCAAGCrrTAGATTCTACT 
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CTAGAGACAGATCCTTATATCACCTCAGGACGCCATCTGGTAGTACCCAAGGACGCGCCGAATCAAAAGGTGACA 
GCCAGTCTGCCAGTGGCGCATGAACTGGAGAGTTTGCAGAGAGATATCCTTgCTTGCAGGCGGGTATGGATGTCTT 
GA 

5 

4122.1 

atgaagaaaagatacctagtcttgacagctttgctagccttgagtctagcagcttgttcac 

aatgaagatggagaaactaagacagaacagacagccaaagctgatggaacagtcggtagtaagtctcaaggagc 

tgcccagaagaaagcagaagtggtcaataaaggtgattactacagcattcaagggaaatacgatgaaatcatcgt 

10 agccaacaaacactatccattgtctajvagactataj\tccaggggaaaatccaacagccaaggcagagttggtcaa 
actcatcaaagcgatgcaagaggcaggtttccctattagtgatcattacagtggttttagaagttatga^ 
accaagctctatcaagattatgtcaaccaagatggaaaggcagcagctgaccgttactctgcccgtcctggctat 
agcgaacaccagacaggcttggccmgatgtgattgggactgatggtgatttggtgacagaagaaaaagcagcc 
caatggctcttggatcatgcagctgattatggctttgttgtccgttatctcaaaggcaaggaaaaggaaacaggct 

15 atatggctgaagaatggcacctgcgttatgtaggaaaagaagctaaagaaattgctgcaagtggtctcagtttgg 
aagaatactatggctttgaaggcggagactacgtcgattaa 

4125.6 

ATGCGTAAATTCTTAATTATTTTGTTGCTACCAAGTTTTTTGACCATT^ 
20 TCGTCTATACTTCGAAAGAAATTTATTACCTTTCACAAT 

CCCATGGTTTATGGAGAGGTTCCTGTTTATGCGAATGAAGATrTAOT 
CAAG i 1 1 1 CAAATAACCGAGTGGCGCITAAATAAACAAGGAATTCCA 

AGCTG CG G AC AAAC G ATTTTT AT ATG ATC AAT C AG AGGT AACTC C AA C AAT AAAAAAAGT ATG GTT AG AATCTG A 
CTTTAAACTGTACAATAGTCCTTATGATTTAAAAGAAGTGAAATCATCCTTATCAG 

25 GACAAGACCATGTTTGTAGAAGGAAGAGAATTTCTACATATTGATCAGGCTGGATGGGTAGCTAAAGAATCAACT 
TCTGAAGAAGATAATCGGATGAGTAAAGTTCAAGAAATGTTATCTGAAAAATATCAGAAAGATTCTrrCT 
ATGTTAAGCAACTGACTACTGGAAAAGAAGCTGGTATCAATCAAGATGAAAAGATGTATGCAGCCAGCGTTTTGA 
AACTCTCTTATCTCT ATTAT ACGCAAGAAAAAATAAATGAGGGTCTTTATCAGTTAGATACGACTGTAAAATA 
ATCTGCAG TCAA TGATTTTCCAGGTTCTTATAAACCAGAGGGAAGTGGTAGTCTTCCrAAAAAAGAAGATAATAA 

30 AGAATATTCTTTAAAGGATTTAATTACGAAAGTATCAAAA 

TACATTTCAAACCAATCTGATGCCACATTCAAATCCAAGATGTCTGCCATTATGGGAGATGATTGGGATCCAAAAG 
AAAAATTGATTTCTT CTAA GATGGCCGGGAAGTTTATGGAAGCTATTTATAATCAAAATGG 
TTTGACTAAAACAGATTTTGATAGTCAGCGAATTGCCAAAGGTGTTTCTGTTAAAGTAG 
CTCAT AAAATTGGAGATGCGGATGAATn-AAGCATGATACGGGTGTTGT 

35 ATITrCACTAAGAATTCrGATTATGATACGATTTCTAAGATAGCCAAGGATGTTrA 

4125.7 

ATGAAAAAACAAAATAATGGTTTAATTA^AAAATCCTTTTCTATGGTTATTATTTAT CIU 

CCAGTATTTCTATTCTGGGAATAACTCAGGAGGAAGTCAGCAAATCAACTATACTGAGTTGGTACAAGAAATTAC 
4U CGATGGTAATGTAAAAGAATTAACTTACCAACCAAATGGTAGTGTTATCGAAGTTTCTGGTGTCTATAAAAATCCT 
AAAACAAGTAAAGAj\GAAACAGGTATTCAGTTTTTCACGCCATCTGTTACTAAGGTAGAGAAATTT^ 
ATTCnTCCTGCAGATACTACCGTATCAGAATTGCAAAAACTTGCTACTGACCATAAAGCAGAAGTAACTGTTAAGC 
ATG AAAGTTCAAGTGGTATATGGATTAATCTACTCGT ATCCATTGTGCCATTTGG AATTCTATTC 1 1 C 1 1 CCTATTC 
TCTATGATGGGAAATATGGGAGGA GGCA ATGGCCGTAATCCAATGAGTTTTGGACGTAGTAAGGCTAAAGCAGCA 
45 AAT AAAGAAG AT ATT AAAGTAAGATTTTC A GATGTTGCTGG AG CT^ 

GTrGAGTTCTTAAAAGATCCAAAACGATTCACAAAACTTGGAGCCCGTATTCCAGCAGGTGTT Cl 11 1 GGAGGGAC 

CTCCGG GGAC AGGTAAAACTTTGCTTGCTAAGGCAGTCGCTGGAGAAGCAGGTGTrCCATTCTTTAGTATCTC 

TTCTGACTTTGTAGAAATGTTTGTCGGAGTrGGAGCTAGTCGTGTTCGCTCTCT 

GCACCAGCTATCATCTTTATCGATGAAATTGATGCTGTTGGACGTCAACGTGGAGTCGGTCTCGGCGGAGGTAATG 
50 ACGAACGTGAACAAACCTTGAACCAACTTTTGATTGAGATGGATGGTTTTGAGGGAAATGAAGGGATT 

TCGCTGCGACAAACCGTTCAGATGTACTTGACCCTGCCCTTTTGCGTCCAGGACGTTTTGATAGAAAAGTATTGGT 
TGGTCGTCCTGATGTTAAAGGTCGTGAAGCAATCTTGAAAGTTCACGCTAAGAATAAGC 
CTTTAGCAGAAGAT GTTG ATTTGAAATTACnrGGCTCAACAAACTCCAGGCTrTTGTTGGTGCT 
CTTGAATGAAGCAGCITTAGTTGCTGCTCGTCGCAATAAATCGATAATTGATGCTTCAGATATTGA 
55 GATAGAGTTATTGCTGGACCTTCTAAGAAAGATAAGACAGTTTCACAAAAAGAACGAGAATTGGTTGCTTACCAT 
GAGGCAGGACATACCATTGTTGGTCTAGTCTTGTCGAATGCTCGCGTTGTCCATAAGGTTACAATTGTACCACGCG 
GCCGTGCAGGCGGATACATGATTGCACTTCCTAAAGAGGATCAAATGCTTCTATCTAAAGAAGATATGAAAGAGC 
A ATTG GCTGGCTTAATGGGTGGACGTGTAGCTGAAGAAATTATCTTTAATGTCCAAACCACAGGAGCTTCAAACG 
ACTTTGAACAAGCGACACAAATGGCACGTGCAATGGTTACAGAGTACGGTATGAGTGAAAAACrTGGCCCAGTAC 
OU AATATGAAGGAAACCATGCTATGCITGGTGCACAGAGTCCTCAAAAATCAATrrCAGAACAAACAGCTTATGAAA 
TTGATGAAGAGGTTCGTTCATT ATT AAATGAGGCACGAAATAAAGCTGCTGAAATTATTCAGTCAAATCGTG AAAC 
TCACAAGTTAATTGCAGAAGCATTATTGAAATACGAAACATTGGATAGTACACAAATTAAAGCTCrTTACGAAAC 
AGGAAAGATGCCTGAAGCAGTAGAAGAGGAATCTCATGCACTATCCTATGATGAAGTAAAGTCAAAAATGAATGA 
CGAAAAATAA 
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4125.10 

ATG AGGG AACCAG A ITTTT i AAATCATTTTCTCAAG AAGGG ATATTTCAAA AAGCATGCTAAGGCGGTTCTAGCTC 

TTTCTGGTGGATTAGATTCCATGTITCrATTTAAGGTATTGTCTACTTATCAAAAAGAGTTAG 

CTAGCTCATGTGAATCATAAGCAGAGAATTGAATCAGATTGGGAAGAAAAGGAj\TTAAGGAAGTTGGCTGCT 

GCAGAGCTTCCTATTTATATCAGCAATTTTTCAGGAGAATTTTCAGAAGCGCGTGCACGAAATTTTC 

TTTTI CAAG AGGTCATG AAAAAG ACAGGTGCG ACAGCTTTAGTCACTG CCCACCATGCTG ATG ATCAGGTGG AAA 

CG A' I U 'I " 14' ATGCGCTTG ATTCG AGG AACTCGCTTGCGCTATCTATC AGG AATT AAGG AG AAGC AAGT AGTCGG AG A 

G AT AG AAATCATTCGTCC C ITC n GC ATTTTC AG A AAAAAG ACTTTCC ATCAA 1 ' 1 ' IT L 'C ACTTTG AAG AT AC ATCA 

AATCAGGAGAATCATTATTrrCGAAATCGTATTCGAAATTCTTACTTACCAGAATTGGAAAAAGAAAATCCT 

TTAGGGATGCAATCTTAGGCATrGGCAATGAAATTTTAGATTATGATTTGGCAATAGCTGAATTATCTAA 

TAATGTGGAAGATTTACAGCAGTTA'l TTTCT1 ACTCTG AGTCTACACAAAG AGTTTTACTTCAAACTTATCTGAATC 

GTTTTCCAGATTTGAATCTTACAAAAGCTCAGTTTGCTGAAGTTCAGCAGATr^ 

TC ATCCG ATT AAAAATGG CT ATG AATTG ATA AAAG AGT A CC AA C A GTTTC AG ATTTGT AAAATC AGTCCG C AG G CT 
GATGAAAAGGAAGATGAACTTGTGTTACACTATCAAAATCAGGTAGCTTATCAAGGATATTTArri I L I 1 1 IGGAC 
TTCCATTAGAAGGTGAATTAATTCAACAAATACCTGTTTCACGTGAAACATCCATACACATTCGTCATCGAAAAAC 

AGGAGATGrrTTGATTAAAAATGGGCATAGAAAAAAACTCAGACGTTTATTTATTG 

ATTTGAAAATCCCTATGGAAAAGAGAAACTCTGCTCTTATTATTGAGCAATTTGGTGAAATTGT 

AATTGCGACCAATAATTTGAGTAAAAAAACGAAAAATGATATAATGAACACTGTACTTTATATAGAAAAAATAGA 

TAGGTAA 

4126.1 

ATG AAGCGTTCTTCTC IT I " l' AGTTAG AATGGTT ATTTCC ATCTTTCTGGTCTTTCTC ATTCTCCTAGCTCTGGTTGG A 

ACTTTCTACrATCAATCAAGTTCTTCAGCCATTGAGGCCACCATTGAGGGCAACAGCCAAACGACCATCAGCCAG 

ACTAGCCACTTTATTCAGTCTTATATCAAAAAACTAGAAACCACCT^ 

TGGCCTATGCTGAGAATCCCAGTCAAGACAAGGTCGAGGGAATCCGAGATTTGTTTTTGACCATCTTGAAGTCAGA 

TAAGGACITGAAAACTGTTGTGCTGGTGACCAAj\TCTGGTCAGGTCATTTCTACAGATGACAGTGTGCAGATGAA 

AACITCCrCTGATATGATGGCTGAGGATTGGTACCAAAAGGCCATTCATCAGGGAGCTATGCCTGTTTTGACrCCA 

GCTCGTAAATCAGATAGTCAGTGGGTCATTTCTGTCACTCAAGAACTTGTTGATGCAAAGGGAGCCAAT CTTGG TG 

TGCTTCGTTTGGATATTTCTTATGAAACTCTGGAAGCCTATCTCAATCAACTCCAGTTGGGGCAGCAGGGCT^ 

CTTCATTATCAATGAAAACCATGAATTTGTCTACCATCCTCAACACACAGTTTATAGTTCGTCTAGCAAAATG 

GCTATGAAACCCTACATCGATACAGGTCAGGGTTATACTCCTGGTCACAAATCCTACGTCAGTCAAGAGAAGATT 

GCAGGAACTGATTGGACGGTGCTTGGCGTGTCATCATrGGAAAAGTTAGACCAGGTTCGGAGTCAGCTCTTGTGG 

ACCITGCTTGGGGCCAGTGTCACATCTCTTCTTGTCTGTCTCTGCTTAGTGTGGTTCAGTC 

TCCTTTGAAGGATTTGAGAGAAACCATGTTGGAAATTGCTTCTGGTGCTCAAAATCTTCGTGCCAAGGA 

GCCTATGAACTGAGAGAAGTAACTCGCCAATTTAATGCTATGTTGGATCAGATTGATCAGTTGATGGTAGCTATTC 

GTAGCCAGGAAGAAACGACCCGTCAGTACCAACTTCAAGCCCTTTCGAGCCAGATTAATCCACATTTCCTCTATAA 

CACTTTGGACACCATCATCTGGATGGCTGAATTTCATGATAGTCAGCGAGTGGTGCAGGTGACCAAGTCCTTGGCA 

ACCTATTTCCGCTTGGCGCTCAATCAAGGCAAGGACTTGATTTGTCTCTCTGACGAAATCAATCATGTCCGCCAGT 

ATCTCTTTATCCAGAAACAACGCTATGGAGATAAGCTGGAATACGAAATTAATGAAAATGTT 

AGTCTTACCCAAGCTGGTCCTACAACCCCTTGTAGAAAATGCTCTTTACCATGGCATTAAGGAAAAGGAAGGTCA 

GGGCCATATTAAACTTTCTGTCCAGAAACAGGATTCGGGATTGGTCATCCGTATTGAGGATGATGGCGTTGGCTTC 

CAAGATGCTGGTGATAGTAGTCAAAGTCAACTCAAACGTGGGGGAGTTGGTCTTCAAAATGTCGATCAACGGCTC 

AAACTTCATTTTGGAGCCAATTACCATATGAAGATTGATTCTAGACCCCAAAAAGGGACGAAAGTTGAAATATAT 

AT AAAT AG AATAG AAACT AG CT AA 



4126.7 

ATG AAG CGTTCHTCTCTTTT AGTTAG AATGGTTAT 

ACITTCTACTATCAATCAAGTTCTTCAGCCATTGAGGCCACCATTGAGGGCAACAGCCAAACGACCATCAGCCAG 

ACTAGCCACTTTATTCAGTCTTATATCAAA AAACT AGAAACCACCTCGACTGGTTTGACCCAGCAGACGGATGTTC 

TGGCCTATGCTGAGAATCCCAGTCAAGACAAGGTCGAGGGAATCCGAGATTTGTTTTTGACCATCTTGAAGTCAGA 

TAAGGACTTGAAAACTGTTGTGCTGGTGACCAAATCTGGTCAGGTCATTTCTACAGATGACAGTGTGCAGATG^ 

AACTTCCTCTGATATGATGG CTGAGGATTG GTACC AAA AGGCCATTCATC AGGG AGCTATGCCTGTTT^ 

GCrCGTAAATCAGATAGTCAGTGGGTCATTTCTGTCACTCAAGAACTTGTTGATGCAAAGGGAGCCAATCTTGGTG 

TGCnTCGTTTGGATATTTCITATGAAACTCTGGAAGCCTATCT 

CTTCATTATCAATGAAAACCATGAATTTGTCrACCATCCrrCAACACACAGTTTATAGTTCGTCTAG 
GCTATGAAACCCTACATCGATACAGGTCAGGGTTATACTCCTGGTCACAAATCCTACGTCAGTCAAGAGAAGATT 
GCAGGAACTGATTGGACGGTGCITGGCGTGTCATCATTGGAAAAGTTAGACCAGGTTCGGAGTCAGCrCTTGTGG 
AC CITG CrTGGGGCCAGTGTCACATCrCITCTTGTCT^ 

TCCnTTGAAGGATTTGAGAGAAACCATGTTGGAAATTGCTTCTGGTGCTCAAAATCTTCGTGCC 

GCCTATGAACTGAGAGAAGTAACTCGCCAATTTAATGCTATGTTGGATCAGATTGATCAGTTGATGGTAGCTATTC 

GTAGCCAGGAAGAAACGACCCGTCAGTACCAACTTCAAGCCCnTTCGAGCCAGATTAATCCACATTTCCTCTATAA 

CACTTTGGACACCATCATCTGGATGGCTGAATTTCATGATAGTCAGCGAGTGGTGCAGGTGACCAAGTCCTTGGCA 

ACCT ATTT CCGCTTGGCGCTCAATCAAGGCAAGGACTTGATTTGTCTCTCTGACGAAATC 

ATCTCTTTATCCAGAAACAACGCTATGGAGATAAGCTGGAATACGAAATTAATGAAAATG 
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AGTCnTACCCAAGCTGGTCCTACAACCCCnTGTAGAAAAT 

GGGCCATATTAAACrTTCT.GTCCAGAAACAGGATTCGGGATTGGTCATCCGTATTGAGGATGATGGCGTTGGCTTC 
CAAGATGC TGGT GATAGTAGTCAAAGTCAACTCAAACGTGGGGGAGTTGGTCTTCAAAATGTCGATCAACGGCTC 
AAACTTCATTTTGGAGCCAATTACCATATGAAGATTGATTCTAGACCCCAAAAAGGGACGAAAGTTGAAATATA 
ATAAATAGAATAGAAACTAGCTAA 

4127.4 

ATGTTTTT TA AATTATTAAGAGAAGCTCTTAAAGTCAAG 

TTTGGTCTTTCGTATCGGAACTAGCATTACAGTTCCTGGTGTGAATGCCAATAGCTTGAATGCTTTAAGTC 
CCTTCTTAAACATGTTG AGCTTGGTGTCGGGG AATGCCCT AAAAAACTTTTCG A 1 ' ITT I 'GCCCTAGG AGTT AGTCC 
CTATATCACCGCTTCTATTGTTGTCCAACrcrTGCAAATGGATA 

GGGGAAGTAGGT CGAAG AAAATTGAATCAAGCTACTCGTTATATTGCTCTAGTTCTCGCrri 1 GTGCAATCTATCG 

GGATTACAGCTGGTrrTAATACCTTGGCTGGAGCTCAj\TTGATTAAAACTGCTTTAACTCCACA^ 

GATTGGTATCATCTTAACAG CTGG TAGTATGATTGTCACTTGGTTGGGTGAGCAAATTACAGATAAGGGATACGGA 

A ACGG TGTTTCCATGATTATCnTTGCCGGGATTGTrrCCTCAATTCCAGAGATGATTCAGGGCATCTATGT 

ACTTTGTG AACGTCCCAAGTAGCCGTATCACTTC ATCTATCA l'l T I CGT AATCATTTTG ATT ATTACTGTATTGTTG 

ATTATrrACTTTACAACrrATGTTCAACAAGCAGAATACAAAATTCCAATCCAATATACT 

CTCCATCTAGCTCTTACCTTCCGTTAAAAGTAAACCCTGCTGGAGTTATCCCT 

GCAGCGCCTGCGGCTATTCTTCAG I 1 1 T 1 GAGTGCCACAGGTCATGATTGGGCTTGGGTAAGGGTAGCACAAGAGA 
TGTTGGCAACTACTTCrCCAACTGGTATTGCCATGTATGCTTTGTTGATTATTCTCT^ 

T AC A G ATT AATCCTG AA AAAGC A GC AG AG AC CT AC AAAAG AGTG GTG CCT AT ATCC ATGG AGTTCGTCCTG GT A A 
AGGT ACAG AAG AAT ATATGTCT AA AC l'l CI "I CGTCGTCTTGCAACTGTTGGTTCCCTCTTCCTTGGTGTG A 



4127.5 

ATGGATATTAGACAAGTTACTGAAACCATCGCCATGATTGAGGAGCAAjAACTTCGATATTAGAACCATTACCATG 

GGGATTTCTCrriTGGACTGTATCGATCCAGATATCAATCGTGCTGCGGAGAAAATCrATCAAAA 

AGGCGGCTAATTTAGTAGCTGTTGGTGATGAAATTGCGGCTGAGTTGGGAATTCCTATCGTTAATAAGCGTGTATC 

GGTGACACCTATTTCTCTGATTGGGGCAGCGACAGATGCGACGGACrACGTGGTTCT 

GGCTGCGAAAGAGATTGGTGTGGACnTATTGGTGGTTTTTCTGCC^ 

GAGATTCTCATCAATTCCATTCCTCGCGCITTGGCTGAGACGGATAAGGTCTGCTCGTCAGTCAATATCGGCT 

CCAAGTCTGCTATTAj\TATGACGGCrGTGGCAGATATGGGACGAATTATCAAGGAAACAGCAAATCI^ 

TGGGAGTGGCCAAGTTGGTTGTATTCGCTAATGCTGTTGAGGACAj\TCCATTTATGGCGGGTGCCITTCATGGTGT 

TGGGGAAGCAGATGTTATCATCAATGTCGGAGTITCTGGTCCTGGTGTTGTGAAAC^ 

GGACAGAGCTTTGATGTAGTAGCCGAAACAGTTAAGAAAACTGCCTTTAAAA 

GTCAAATGGCCAGTGAGAGACTGGGTGTGGAGTTTGGTATTGTGGACITGAGTTTGGCACCAACCCCTGCGGTTGG 
AGACTCTGTGGCACGTGTCCirGAGGAAATGGGGCTAGAAACAGTTGGCACGCATGGAACGACGGCTGCCTTGGC 
CCTCTTGAACGACCAJVGTTAAAAAGGGTGGAGTGATGGCCTGCAACCAAGTOT 
CCrGTTTCTGAGGATGAAGGAATGATTGCTGCAGTGCAAAATGGCTCTCTTAATTTA 

CGGCTATCTGTTCTGTTGGATTGGATATGATTGCCATCCCAGAAGATACGCCTGCTGAAACTATTGCGGCTATGAT 
TGCGGATGAAGCAGCAATCGGTGTTATCAACATGAAAACAACAGCTGTTCGTATCATTCCCAAAGGAAAAGAAGG 
CGATATGATTGAGTTTGGTGGTCTATTAGGAACTGCACCCGTTATGAAGCTrrAATGGGGCTTCGTCT 
ATCTCrCGCGGTGGACAAATCCCAGCACCAj\TTCATAGTTTTAAAAATTAA 

4128.1 

ATGACACAG ATT ATTGATGGGAAAGCTITAGCGGCCAAATTGCAGGGGCAGTTGGCTG AAAAG ACTGCAAAA 

AAGGAAGAAACAGGTCTAGTGCCTGGTTTGGTAGTGATTTrGGTTGGGGACAATCCAGCCAGCCAAGTCr 

CGCAACAAGGAGAGGTCAGCCCTTGCGGCrGGTTTCCGTAGCGAAGTAGTACGGGTTCCAGAGACCATTACTCAA 

GAGGAATTGTTAGACCTGATTGCTAAATACAATCAGGATCCAGCTTGGCATGGGATTITGGTTCAGTTGCCATTA^ 

CAAAACACATTGATGAAGAGGCGGTTCTATTGGCTATTGACCCAGAAAAGGATGTGGATGGTTTCCATCCTCTAA 

ACATGGGGCGTCTTTGGTCTGGTCATCCAGTCATGATTCCTTCGACACCGGCAGGAATTATGGAAATGTTCCATGA 

ATATGGGATTGACTTGGAAGGTAAAAATGCAGTCGTCATCGGTCGATCCAATATTGTCGGAAAACCTATGGCCCA 

GCTTCHTTTGGCAAAGAATGCAACAGTAACCTTGACTCACTCA^ 

GCAGATATTCTGGTTGTTGCAATCGGTCGTGCCAAGTTTGTGACTGCTGACTTTGTCAAACCAGGTGCGGTAGTCA 
TTGACGTTGGGATGAACCGCGATGAAAATGGTAAGCTCTGTGGGGATGTTGATTATGAGGCGGTTGCCCCACTTGC 
TAGCCACATTACGCCAGTCCCTGGAGGTGTCGGTCCTATGACCATTACTATGCTGATGGAGCAAACCTATCAGGCA 
GCACTTAGGACATTGGATAGAAAATAA 

4128.2 

ATGTCTAAATTTAATCGTATTCATTTGGTGGTACrGGATTCTGTAGGAATCGGTGCAGCACCAGATGCTAATAACT 

TTGTCAATGCAGGGGTTCCAGATGGAGCTTCTGACACACTGGGACACATTTCAAAAACAGTTGGTTTGAATC 

AAACATGGCTAAAATAGGTCTTGGAAATATTCCTCGTGAAACTCCTCrTAAGACT 

AACTGGATATGCAACAA AATT AGAG GAAG TATCTCTTGGTAAGGATACT 

ACTCAACATTACTGAGCCTTTCGATACITTCTGGAAC^ 

TCAGGACGCAAGGTTATTCGTGAAGCCAACAAACCITATTCAGGAACGGCrGTTATCTATGATTTTGGACC^ 
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AG ATGG AAACTGGAGAGTTGATTATCTATACTTCAGCTGACCCTGTTTTGCAGATTGCTGCCCACGAAGACATTAT 
TCCITTGGATGAATTGTACCGTATCTGTGAATACGCrCGTTC 

T CATTG CTCGCCCT TATG TAGGTGAACCAGGTAACTTCACTCGTACGGCAAACCGTCGTGACTTGGCTGTATCTCC 
Ainu CCCAACTGTITTGG ATAAATTGAATGAGGCTGGTATCG ATACTTATGCTGTGGGTAAAATCAACGATATC 
5 TTTAACGGTGCTGGTATCAACCATGACATGGGTCACAACAAGTCAAATAGTCATGGAATTGATACACTATTGAAG 
ACTATGGGACTTGCTGAGTTTGAAAAAGGATTCTCATTCACAAACCTAGTTGACTTTGA 

GTCGTAATGCTCACGGTTACCGTGATTGCTTGCATGAGTTTGATGAACGCTTACCTGAAATTATCGCAGCTATGAG 
AGAGAATGACCTTCTCTTGATTACTGCGGACCATGGAAATGACCCAACGTATGCAGGAACGGATCACACTCGGGA 
ATATATTCCATTGTTGGCCTATAGCCCTGCCTTTAAAGGAAA 
10 CAGCGACTGTTGCCGATAACTTTGGTGTGGAAACTGCTATGATTGGGGAAAGTTTCTTAGATAAATTGGTATAA 

4L29.2 

ATGTTTATTTCCATCAGTGCTGGAATTGTGACATTTT^ 

TAGAAAGGCGCAAA TTACA GGCCAGCAGATGCATGAGGATGTCAAACAGCATCAGGCAAAAGCTGGGACTCCTA 
15 CAATGGGAGGTTTGGTTTTCnTGATTACTrCTGTTTTGGTTGCl 11 HI 1 ' I ' l CGCCCTATTTAGTAGCCAATTCAGCA 

ATAATGTGGGAATGATTTTGTTCATCTTGGTCTTGTATGGCT^ 
CGTAAAATCAATGAGGGGCTTAATCCTAAGCAAAAATTAGCTCT 

TCTAT GAGCGCGGTGG CGATAT CCTGTCTGTCTTTGGTTATCCAGTTCATTTGGGATTTTTCTATA 1 ' IT 1 CI 1 CGCT 
CTTTTCTGGCTAGTCGGTTTTTCAAACGCAGTAAACTTGACAGACGGTGTTGACGGTTTAGCTAGTAT^ 
20 GATTAGTTTGTCTGCCTATGGAGT TATTG CCTATGTGCAAGGTCAGATGGATATTCTTCTAGTGATTCTTGCCATGA 
TTGGTGGTTTGCTCGGTTTCTTCATCTTTAACCATAAGCCTGCCAAGGTCTTTATGGGTGATC 

CTAGG TGGGATGCTGGCAGCTATCTCrATGGCTCTCCACCAAGAATGGACTCTCTTGATTATCGGAATTGTGTATG 
ITTT l G AAACAACTTCTGTT ATG ATGC AAGTC AGTT ATTTC AAACTG ACAGGTGGT AAACGT ATTTTCCGT ATG AC 
G CCTGTA CATCACCATTTTGAGCITGGGGGATTGTCTGGTAAAGGAAATCCTTGGAGCGAGTGGAAGGTTGACTTC 
25 TTCriUUGGGGAGTGGGACITCrAGCAAGTCTCCTGACCCTAGCAATTTTATATTTGATGTAA 

4133.1 

TTGTTTAAGAAAAATAAAGACATTCTTAATATTGCATTGCCAGCTATGGGTGAA^ 
GAATGGTGGACAGTTATTTGGTTGCTCATTTAGGATTGATAGCTATTTCAGGGG 
30 CACCATTTATCAGGCGATTTTCATCGCrCTGGGAGCTGCTATTTCCAGTGTTATTTCAAAA^ 
GACCAGTCGAAGTTGGCCTATCATGTGACTGAGGCGTTGAAGATTACCTTACT^ 

GTCC ATCTT CGCrGGGAAAGAGATGATAGGACTTTTGGGGACGGAGAGGGATGTAGCTGAGAGTGGTGGACTGTA 
TCTATCTTTGGTAGGCGGATCGATTGTTCTCTTAGGTTTAATGACT 

AATCC ACGTCTGCCTCTCTATGTT AGTTTTTTATCCAATGCCTTG AATATT C im 1 ' 1 CAAGTCT AGCTATTTTTGTT 
35 CTGGATATGGGGATAGCT.GGTGTTGCTTGGGGGACAATTGTGTCTCGTTTGGTTGGTCTTGTGATTTTGTGGTCAC 
AATTAAAACTGCCTrATGGGAAGCCAACTTTTGGTTTAGATAAGGAACrGTTGAC 

AGAGCGACTTATGATGAGGGCTGGAGATGTAGTGATCATTGCCTTGGTCG rriCl ' l ' rr GGGACGGAGGCAGTTGCT 
GGGAATGCAATCGGAGAAGTCTTGACCCAGTTTAACTATATGCCTGCCTTTGGCGTCGCTACGGCAACGGTCATGC 
TGTTGGCCCGAGCAGTTGGAGAC^ATGATTGGAAAAGAGTTGCT 
40 GTTCCTCATGTTGCCCCTGTCCTTTAGTATATATGTCTTCGGTC^ 

TAGCGGTGGAGGCTAGTGTTCTAGTGACACTGTITTCACTACTTGGGACCCCTATGACGACAGGAAC^ 
TACGGCAGTCTGGCAGGGATTAGGAAATGCACGCCTCCCnTTTTATGCGACAAGTATAGGAATGTGGTGTATCCGC 
ATTG GGAC AGGATATCTGATGGGGATTGTGCTTGGTTGGGGCTTGCCTGGTATTTGGGCAGGGTCTCTCTTGGATA 
ATGGTTTTCGCTGGTTATTTCTACGCTATCGTTACCAGCGCTATATGAGCTTGAAAGGATAG 

45 

4135.2 

ATGCAAACTCAAGAAAAACACTCGCAAGCAGCCGTTCTTGGCTTGCAGCAC 

ATCCTGGTTCCCATCATGATTGCGACAGCCCTTGGCTATTCAGCTGAGCAGTTGACCTACCTGATTTCTACAGATA 
- n TCTTCATGTGTGGGGTGGCAACCTTCCTCCAACrCCAACTC 

DO TGGAGTTGCATTCCAGTCGGTCGCrCCCTTGATTATGATTGGGCAAAGCCATGGTAGTGGCGCTATGTTTGGTGCC 
CTTATCGCATCTGGGATTTACGTGGTTCITGTTTCAGGCAT^ 

AACAGGATCTGTTATTACCACGATTGGTITAACCriTGATCCCTGTCGCTATTGGAAATATGGGAA^ 

GAGCCAACTGGTCAAAGTCTCTTCCTTGCAGCTATTACTGTTCTGATTATCCT 

GATTTATCAAGTCTATCTCTATTTTGATTGGTCTGGTTGTTGGAA 
JD TTCTCrCCrGTTGCGGTAGCTCCACTTGTCCATGTCCCAACrCCACTCTACITT 

ATCTATTGTCATGATGTGTATCATCGCAACGGTGTCTATGGTTGAGTCAACTGGTGTTTATCTGGCCTTGTCTGATA 

TCAC AAAG GATCCAATCGACAGCACGCGCCTTCGCAACGGATACCGCGCAGAAGGTTTGGCCGTACTTCTCGGAG 

GAATCTTTAACACCnTCCCTTACACCGGATrTrCACAAAACGTTGGTTTGGTT^^ 

CCTGCCAATCTACTACGCAGCTGGTTTCCTGGTTCTCCnTGGA 
OU TTCCAAGCT CCGT CCTCGGTGGTGCCATGCTGGTAATGTTTGGTrTTGTATCAATTCAAGGGATGCAAATCCTCGC 

CCGTGTTGACT ITGCT AACAATGAACACAACITCCTTATCGCAGCTGTTTCAATCGCTGCAGGTGTC 

AACAGTAATCTCTTTGTCAGCATGCCGACAGCCTTCCAAATGTTCTT 

TCGCTATTGTCCTCAATGCCGTATTAAATCATAAAAAGAAATAA 

65 4136.2 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



41 



PCT/GB99/02451 



ATGAAAGATAGAATAAAAGAATATTTACAAGACAAGGGAAAGGTGACTGTTAATGATTTGGCTCAGGCTTTGGGA 
AAAGACAGTTCCAAGGATTTTCGTGAGTTGATTAAAACCTTGTCC^ 

AAGAAGATGGTAGTCTGACATTAGAAATTAAGAAAAAACATGAGATTACCCrCAAGGGGATTTT^ 
AAAATGGCTTTGGCrTTGTTAGTCTGGAAGGCGAGGAGGACGAC Crri n GTAGGGAAAAATGATGTCAACTATGC 
TATTGATGGTGATACCGTCGAGGTAGTGATTAAGAAAGTCGCTGACCGCAATAAGGGAACAGCAGCAGAAGCCAA 
AATTATTGATATCCTAGAACACAGTTTGACAACAGTTGTCGGGCAmAATCGTTCTGGATCAGGAAAAACCTAAGTAT 

gctggctatattcgttcaaaaaatcagaaaatcagtcaaccgatttatgttaagaaaccagccctaaaatta 

GGAACAGAAGTTCTCAAAGTCrri ATCGATAAATACCCAAGCAAGAAACATGAIl"rCriUGTCGCGAGTGTTCTCG 

ATGTAGTGGGACACTCAACGGATGTCGGAATTGATGTTCITGAGGTCTTGGAATCAATGGACATTGTATCCGAG 

TCCAGAAGCTGTTGTTAAGGAAGCAGAAAGTGTGCCTGATGCTCCGTCTCAAAAGGATATGGAAGGTCGTCTGGA 

TCTAAGAGATGAAATTACCTITACCATTGACGGTGCGGATGCCAAGGACTTGGACGATGCAGTGCATATCAAGGC 

TCTGAAAAATGGCAATCTGGAGTTTGGGGTTCACATCGCAGATG n 'l' ClT ATTATGTGACCGAGGGGTCTGCCCTT 

GACAAGGAAGCCCTTAACCGTGCGACTTCTGTTTACGTGACAGACCGAGTGGTGCCAATGCTTCCAGAACGACTA 

TCAAATGGCATCTGCTCTCTCAATCCCCAAGTTGACCGCCTGACCCAGTCTGCTATTAT 

GGAGATTGATAAACATGGTCGTGTGGTCAACTATACCATTACACAAACAGTTATCAAGACCAGTTTTCGTATGACC 

TATAGCGATGTCAATGATATCCTAGCTGGCGATGAAGAAAAGAGAAAAGAATATCATAAAATTGTATCAAGTATC 

GAACTCATGGCCAAGCTTCATGAAACTTTAGAAAACATGCGTGTGAAACGTGGAGCTCTCAATTTTG 

AAGCGAAGATTTTAGTGGATAAACAAGGTAAGCCTGTTGATATCGTTCTTCGGCAGCGTGGTATTGCCGAGCGGA 

TGATTGAGTCTTTTATGTTGATGGCTAATGAAACAGTTGCCGAACATTTCAGCAAGTTGGAT^ 

CGAATTCACGAGGAGCCTAAGGCTGAAAAGGTTCAGAAGTTTATTGATT^ 

ATGGAACTGCCAGTGAGATTAGTCAGGAGGCACTTCAAGACATCATGCGTGCTGTTGAGGGAGAACCTTATGCAG 

ATGTATTGTCCATGATGClTCrrCGCTCrATGCAGCAGGCTCGTTATTCGGAGCACAATCACGGCCACTATGGACT 

AGCTGCTGACTATTATACTCACTTTACCAGTCCAJVTTCGTCGTTATCCAGACCTTCTTGTTCACCGTA 

ATTACGGCCGTTCTAAGGAAATAGCAGAGCATTTTGAACAAGTGATTCCAGAGATTGCGACCCAGTCTTCCAACC 

GTGAACGTCGTGCCATAGAAGCTGAGCGTGAAGTCGAAGCCATGAAAAAGGCTGAGTATATGGAAGAATACGTGG 

GTGAAGAGTATGATGCAGTTGTATCAAGTATrGTCAAATTCGGTCTCTTTGTCGAATTGCCAAACACAGTTGAAGG 

CTTGATTCACATCACTAATCTGCCTGAATTTTATCATTTCAATGAGCGTGATTTGACTC^ 

GGTATCACTTTCCGAGTGGGTCAGCAGATCCGTATCCGTGTTGAAAGAGCGGATAAAATGACTGGAGAGATTGAT 
TTTTCATTCGTACCTAGTGAGTTTGATGTGATTGAAAj^GGCITGAAACAGTCTAGTCGT 

AGTGGCAGAGGGCGTGATTCAAATCGTCGTTCGGATAAGAAGGAAGACAAGAGAAAATCAGGACGCTCAAATGA 

TAAGCGTAAGCATTCACAAAmAAGACAAGAAGAAAAAAGGAAAGAAACCTTTTTACAAGGAAGTAGCT 

GAGCCAAGCATGGCAAAGGGCGAGGGAAAGGTCGTCGCACAAAATAA 

4137.2 

ATGGGCACAACAGGATTTACAATAATTGACTTAATT ATCTT GATTGTTTATTTACTTGCGGTGTTG 

CT ATTTCTCT AAAAAAG AG ATG AAAG G AA AAG AGTTCTTT AAAGG AG ATG GTTCG GTTCCTTG GT ATGTT ACTTCG 

GTATCCAlMU'IU'GCCACAATGCTCAGTCCGATTTCClTCrrGGGACTCGCTGGTAGCTCTTATGCAGGTAGCTGGAT 

TTTATGGTTrGCTCAATTAGGGATGGTAGTAGCTATTCCACTGACAATTCGTTTTATCTTACCT 

TAGACATCGATACGGCATATGATTACTTGGATAAACGTTrrAATTCTAAAGCACrTCGTATTATT^ 

GTTTATTATTTATCAATTGGGACGTATGTCTATCATTATGTACCrCCGAT^ 

TTG AC ATCAATATTTTG ATT A ITTI G ATGGGTGT AGTTGCAATTGTTTATTCTTATACTGGTGGTCTAAAATCCGT A 

TTATGGACAGACTTTATTCAAGGTGTGATTCTGATTAGTGGTGTCGTTT^ 

TAAAGGTGGCTTrGGTGCAGTAGCAGAAACATTAGCAAACGGGAAATTCCTTGCTGCAA^ 

TCCTAACTTGCnTCAAACTCCATUri TTI AATTGTGATGGGTTCAGGCn TACAATCTTGTCTTCCTATGCTTCATC 

TCAAGATTTGGTTCAACGTTTTACTACAACACAAAATArTAAGAAACITAATAAG 

TTCTCACTTGCA^CTGCAACAGTCTnrrACnTGATTGGTACAGGCTTGTACGTAT^ 

ATAGTGCAGCTAGCAATATCCCTCAAGACCAAATCTTTATGTACTTTATTGCATACCAGTTACCAGTAGGTATCAC 

AGGTTTGATCTTGGCAGCGATTTATGCAGCATCTCAATCAACTATTTCAACAGGTTTGAACTCT 

TGGACATTGGATATTCAAGATGTCATTTCTAAAAATATGTCAGACAATCGTCGTACGAAAATTGCACAATTCGTAT 

CTCTAGCAGTAGGTTTATrCTCAATrGGTGTTTCCATTGTCATGGCTCACTCAGATATTAAATCTGCATACGAATG 

rrCAATAGTrTCATGGGACTTGTACTTGGTCTACrrGGTGGTGTATTTATrCIT 

TAAACAAGGTGCTTATGCAGCGCTGATTGTATCAACCATCGTCATGGTATTTATTAAATACTTCCTTCCTCCA^ 
GCTGTTAGCTACTGGGCATATTCATTGATTrCAATCTCTGTATCAGTAGTTTCAGGTTATATrGTATCT 
TGGAAATAAAGTATCTGCACCTAAj\TATACAACGATTCATGATATTACAGAAATTAAAGCGGATTCAAGTTG 
AGTTCGTCACTAA 

4138.1 

ATGAAATTTAGTAAAAAATATATAGCAGCTGGATCAGCTGTTATCGTATCCTTGAGTCTATGTGCCTATGCACT 

ACCAGCATCGTTCGCAGGAAAATAAGGACAATAATCGTGTCTCTTATGTGGATGGCAGCCAGTCAAGTCAGAAAA 

GTGAAAACTTGACACCAGACCAGGTTAGCCAGAAAGAAGGAATTCAGGCTGAGCAAATTGTAATCAAAATTACAG 

ATCAGGGCTATGTAACGTCACACGGTGACCACTATCATTACTATAATGGGAAAGTTCCTTATGATGCCCTCTTTAG 

TGAAGAACTCTTGATGAAGGATCCAAACTATCAACTTAAAGACGCTGATATTGTCAATGAAGTCAAGGGTGGTT 

TATCATCAAGGTCGATGGAAAATATTATGTCTACCTGAAAGATGCAGCTCATGCTGATAATGTTCGAACTAAAGAT 

GAmAATCAATCGTCAAAAACAAGAACATGTCAAAGATAATGAGAAGGTTAACTCTAATGTTGCTGTAGCAAGGTCT 

CAGGGACGATATACGACAAATGATGGrTATGTCTTTAATCCAGCTGATATTATCGAAGATACGGGTAATGCTTATA 
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TCGTTCCTCATGGAGGTCACTATCACTACATTCCCAAAAGCGATTTATCTGCTAGTGAATTAGCAGCACCTAAAGC 

ACATCTGGCTGGAAAAAATATGCAACCGAGTCAGTTAAGCTATTCTTCAACAGCTAGTGACAATAACACGCAATC 

TGTAGCAAAAGGATCAACTAGCAAGCCAGCAAATAAATCTGAAAATCTCCAGAGTCTTTTGAAGGAACTCT 

TTCACCTAGCGCCCAACGTTACAGTGAATCAGATGGCCTGGTCrTTGACCCTGCTAAGATTATCAGTCGTACACCA 

AATGGAGTTGCGATTCCGCATGGCGACCATTACCACTTTATTCCTTACAGCAAGCTTTCTGCCTTAGAAGAAAA 

TTGCCAGAATGGTGCCTATCAGTGGAACTGGTTCTACAGTTTCTACAAATGCAAAACCTAATGAAGTA 

TCTAGGCAGTCTTTCAAGCAATCCTTCTrCTTTAACGACAAGTAAGGAGCTCTCTTCAGCATCTGATGG 

TTAATCCAAAAGATATCGTTGAAGAAACGGCTACAGCTTATATTGTAAGACATGGTGATCATTTCCATTACATTCC 

AAAATCAAATCAAATTGGGCAACCGACTCTTCCAAACAATAGTCTAGCAACACCTTCTCCATCTCTTC 

CCAGGAACTTCACATGAGAAACATGAAGAAGATGGATACGGATTTGATGCTAATCGTATTATCGCTGAAGATGAA 

TCAGGTTTTGTCATGAGTCACGGAGACCACAATCATTATTTCTTCAAGAAGGACTTGACAGAAGAGCAAATTAAG 

GTG CGC AAAAAC ATTTAG 

4139.1 

ATGAAAAAAAGAGCAATAGTGGCAGTCATTGTACTGCTITTGATTGGGCTGGATCAGTTGGTCAAATCCTATATCG 
TCCAGCAGATTCCACTGGGTGAAGTGCGCrCCTGGATCCCCAATTTCGTTAGCTTGACCTACCTGCAAAATCGAGG 
TGCAGCCIUUM'CTATCTTACAAGATCAGCAGCTGTTATTCGCTGTCATTACTCTGGTTGTCGTGATAGGTGCCATTT 
GGT ATTT ACATAAACACATGGAGGACTCATTCTGGATGGTCTTGGGTTTGACTCTAATAATCGCGGGTGGTCTTGG 
AAACTrTATTGACAGGGTCAGTCAGGGCTTTGTTGTGGATATGTTCCACCTTGACTTTATC 

ATGTGGCAGATAGCTATCTGACGGTTGGAGTGATTATTTTATTGATTGCAATGCTAAAAGAGGAAATAAATGGAA 
ATTAA 

4139.5 

ATGAATACAAATCTTGCAAGTTTTATCGTTGG ACTG ATCATCG ATGAAAACGACCG T TTT T ACTTTGTGCAAAAGG 
. ATGGTCAAACCTATGCTCTTGCTAAGGAAGAAGGCCAACATACAGTAGGGGATACGGTCAAAGGTTTTGCATACA 
CGGATATGAAGCAAAAACTCCGCCTGACAACCTTAGAAGTGACTGCCACTCAGGACCAATTTGGTTGGGGACGTG 
TCACAGAGGTTCGTAAGGACTTGGGTGTCTTrGTGGATACAGGCCTTCCTGACAAGGAAATCGTTGTGTCACTCGA 
TATTCTCCCTGAGCTCAAGGAACTCTGGCCTAAGAAGGGCGACCAACTCTACATCCGTCTTGAAGTGGATAAGAA 
AGACCGTATCTGGGGCCTCTTGGCTTATCAAGAAGACTTCCAACGTCTTGCTCGTCCTGCCTACAACAACATGCAG 
AACCAAAACTGGCCAGCC ATTGTTT ACCGTCTCAAGCTGTC AGG AAC r IT I GTTTACCT ACCAG AAAAT AATATGC 
TTGGTTTTATTCATCCTAGCGAGCGTTACGCAGAGCCACGTTTGGGGCAAGTATTAGATGCGCGCGTTATTGGTTT 
C CGTG AAGTGGACCGCACTCTGAmACCTCTCCCTCAAACCACGCTCCTTTGAAATGTTGGAAAACGATGCTCAGATG 
A TTTT GACTTATTTGGAAAGCAATGGCGGTTTCATGACCTTAAATGACAAGTCATCTCCAGACGACATCAAGGCAA 
CCTTTGGCATTrCTAAAGGTCAGTTCAAGAAAGCTTTAGGTGGTCTTATGAAGGCTGGTAAAATCAAGC^ 
GTTTGGGACAGAGTTG ATTTAG 

4139.8 

ATGAAAGATGTTAGTCTA IT I'l 1 ATTGAAAAAAGTTTTCAAAAGCCGCTTAAACTGGATTGTCTTAGCTTTATTTGT 
ATCTGTACTCGGTGTTACCnTTTATTrAAATAGTCAGACTGCAAACTCACACAGCTTGGAGAGCAGGTTGGAAAG^ 
CGCATTGCAGCCAACGAGAGGGCTATCAATGAAAATGAAGAGAAACTCTCCCAAATGTCTGATACCAGCTCGGAG 
GAATACCAGTrrGCTAAAAATAATTTAGACGTGCAAAAAAATCrTTTGACGCG 

TAAAAGAAGGGCGCTGGAAAGAAGCCTACTATTTGCAGTGGCAAGATGAAGAGAAGAATTATGAATTTGTA 
ATGACCCGACTGCTAGCC CTGG CTTAAAAATGGGGGTTGACCGCGAACGGAAGATTTACCAAGCCCTGTATCCCT 
TGAACATAAAAGCACATACTTTGGA GTTTCC GACCCACGGGATTGATCAGATTGTCTGGATTTTAGAGGTTATCAT 
CCCAAGTTTGTTTGTGGTTGCTATTATrriTATGCTAACACAACTATTTGCAGAAAGATATC 

ACAGCTCACTTATATCCTGTTTCAAA AGTGAC ATTTGCAATATCCTCTCTrGGAGTTGGAGTGGGATATGTAACTG 

TGCTGTTTATCGGAATCTGTGGCTTlTCi ri TCTAGTGGGAAGTCTGATAAGTGGTTTTGGACAGTTAGATTATCCC 

TAC CCAA TTTATA GCIT AGTGAATCAAGAAGTAACTATTGGGAAAATACAAGATGTATTATTTCCTGGC 

TA GCTTTCTT AGCCTTTATCGTCATTGTGGAAGTTGTGTACTTGATTGCTTACTTTTTC 

CTCT ri CTT TCACT CATTGGGATTGTTGGCTTATTGTTTGGTATCCAAACCATTCAG 

TCTGATTCCCTTTACITACITGCGTrCAGTGGAGATTTrATCTGGAAGATTACCTAAGCAGA 

TAAATTGGAGCATGGGAATGGTCTTACTTCCTTGCCTGATTATCTTTTTGCTATTGGGAATT 

TGGGGAAGTTCACAGAAAAAAGAATTTTTTAATAGATTCTAG 

4141.1 

ATGATGAAGTTCATATrGGATATTGTTAGTACACCAGCTATTTTAGTAGCTTTAATrGCAATCTTAGGATTAGTTCT 

TCAGAAGAAGAAATTACCT GATA TTATTAAAGGTGGAATrAAGACCTTTGTTGGlM'lCi l AGT^ 

GCAGGAATTGTACAAAA'lTCrrrAAA TCCA TTrGGTACCATGTrTGAGCATGCTrTTCATTTATCT 

GAATAATGAAGCAATTGTAGCT GTAG CTTTAACAACATATGGCTCAGCTACTGCAATGATTATGTTTGCAGGCATG 

GTGTTCAATATCTrAATCGCTCGTTTTACTCGATTTAAATATATTTTTTTAACAGGGCAC 

ATGTATGATTGCGGTCATTTTATCAGTTGCTGGCTTrACTAGCITGCCTCTCATCrTACrAGGAGGATrAGCACTCG 

GTATTATTATGAGT ATTT CCCCAGCATTTGTGCAAAAATATATGGTTCAATTAACTGGAAATGA 

AGGTCATrrCAGTTCTTTGGGATATrGGTTGAGTCGTTTTA 

GAGGACATTAAATrTCC AAAGA GTTTAGCnMU'rrACGTGATAGTACrGTTAGTATTACTTTATCCATGGCAGTTAT 
TTACATTATTGTAGCrATCTTTGCAGGGTCAGAATATATAGAAAAAGAAATCAGTAGTGGTACAAGTGGTCTAGTT 
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TATGCTTTACAATTAGCAGGTCAATTTGCAGCAC^ 
AATTGTTCCAGCCTTTAAAGGTATTTCAGAGCGTCTTGTA 

ATACTTATGCACCCAATGCAGTTCTAATTGGATTTATCTCTAGTTTTGTTGGTGGm 

ATTGCTTCAGGAACGGTTGTTATCTTACCAGGTGTTGTGCCTCA nTClTCTGTGGAGCGACTGCAGGTGTCATTGG 
GAATGCATCTGGTGGTGTTCGTGGAGCCACTATTGGAGCA ITTTTACAAGGTATTTTAATCAGTTTTCTTCCAGTCT 
riTTAATGCCAGTTTTGGGAGGACTTGGTTTCCAAGGATCAACTrrCTCAGATGCAG 
ATTTTAGGAATGTTAAATCAATTTGGCTCACAAGCAGGCATTGTGATTGGTCTTGTTCTTATTCT 
rGGAGTATCCTTTATTAAAAAGCCATCTGCAACGGAGGAATAA 

4142.3 

ATGATTAAAACATTTCTCTCTGCCCTTTCGGTCATTCTCnTTTCTATCCCT 

TCTAATCTTAj\CATTTGGCTATCTACCCAACCTATCTTGGCACAGATTTATGCCTTCCCCITAG 
GGCTGCTATTTTAAGTTTCrTATTTTTTTTCCTATCT^ 

TTTTGCTCTTACTATCGCTCATATTACTATTATTCGGAACAGATAAAACC ClUl ' Cl^ CTGCATCAAA 
ACCTTAAAATTAGTAACTTGGAACGTCGCTAATCAAATAGAAGCACAACATATTGAGCGAAi rT l I AGCCATTTTG 
ACGCCGATATGGCTATATTCCCTGAACTAGCTACCAATATCAGAGGTGAGCAAGAAAACCAGAGAATCAAACTAT 
TGTTTCATCAAGTTGGACTTTCTATGGCCAACrATGATATTTTCACnTCT 

GTGACTGTGATTGTCAAGAAAAGTTATGGTTrCTATACAGAAGCTAAAACriMMCATACAACACGGTTCGGGACAA 

TTGTATTACATTCGAGAAAACAAAATATACCAGATATCATTGCCTTGCATACTGCGCCTCCTCTGCCAGGTTTAAT 

GGAAATCTGGAAGCAAGACTTAAACATCATTCATAATCAATTGGCTTCAAAATATCCAAAGGCTATTATTGCAGGT 

GATTTTAATGCAACTATGCGTCATGGAGCACTTGCAAAAATAAGCTCTCATAGGGACGCATTAAATGCACT 

CCrmGAAAGAGGAACTTGGAATAGCCAAAGTCCAAAACT^ 

AAAACC ACT ACT ATGTT AAAG ATTT AG AC ATrGTAAGTTTTCAAAACTCTG ATCATAG ATGT A l'l'Tl'l AC AG AAAT 
CACATTTTAA 

4142.4 

ATGAATCCAATCCAAAGATCTTGGGCn-ATGTCAGCAGAAAGCGACT 

TGGTCTTATTGGCCGGAATTTCAGCCTGTTTGACTCTGATGAAGTCCAACAAAACAGTAGA 

ATCACTCAATACATCTTTITCTATTAAGAj^GATAGAGAATGGTCAGACATrCAAGTrGTCAGACCrAGCATCT 

AGCAAGATTAAGGGGCTGGAAAATGTCTCTCCTGA^CITGAGACGGTCGCAAAACrAAAAGACAAGGAAGC 

ACTGGCGAGCAGAGCGTGGAGCGTGATGATrrATCAGCTGCAGACAATAACTTGGTTAGCTTAACGGCrCTTGAG 

GATTCATCCAAGGATGTAACCTTTACCAGTTCGGCTTTCAATCTAAAAG 

TCCAAGAAAATCCTTATCCACGAAGAATTGGCTAAGAAGAACGGTCnTTCGCTTCATGACAAGATTGGCT 

CTGGTCAGTCTGAATCTGGAAMAAGGACAAACAGTAGAGTTTGAGATrATCGGCAT Crri ' I CTGGTAAAAAACAAG 

AGAAATTCACAGGCTTGTCTTCTGACTTCAGTGAAAATCAAGTCnTTACAGACTATGAAAGTAGCC 

GGGCAATAGTGAAGCTCAAGTCAGTGCAGCACGCTTCTATGTAGAAAATCCTAAGGAAATGGACGGACrCATGAA 

GCAGGTAGAAAACITGGCCTTGGAAAATCAAGGCTACCAAGTCGAAAAGGAAAACAAGGCTTTTGA^ 

AGACTCAGTTGCAACTTTCCAAACCTTCCTGACCATCITCCnTTATGGGATGTTGAT 

ATTCTGGTTTTGTCTCTCTGGTTGAGAGAACGGGTCTATGAAGTGGGGATTTTACTrGC 

GCTCGATCITCCTACAATrCTGTTTAGAGGTAGTTITGGTATCrCTTGGAGCTTTGCT^ 

AACGCAATCACAACITACCTACTCCAAACTCTACrAGCAAGTGGAGATCAGGCAAGCTTACAAGATACACTAGCC 
AAAGCAAGCAGTTTATCAACTAGCATCITATCITTTGCAGAAT 

TGTAGCCCTTrGTTTCCrATrCTTATTTAGAAAATCACCGAAAGAAATTrTATCATCT 
4142.5 

ATGTTACACAACGCATTTGCCTATGTTACAAGGAAGTTrrTCA^TCGATTGTCATCTTCCT 

GGCGAGCTTGAGTTTGGTCGGCTTGTCAATCAAGGGAGCTACrGCCAAGGCTTCTCAGGAGACCTTTAAAAA 

ACCAATAGCITCTCCATGCAAATCAATCGTCGCGTCAACCAAGGAACGCCTCGTGGTGCTGGGAATATCAAGGGT 

GAAGACATCAAAAAAATCACCGAAAACAAGGCCATTGAGTCnTATGTCAAACGTATCAACGCTATCGGAGATTTG 

ACTGGATATGACCrGATTGAAACGCCAGAAACCAAGAAGAATCTCACTGCTGATCGTGCCAAGCGTTTTGGAAGT 

AGCTTGATGATTACAGGTGTCAATGACTCCTCrAAAGAAGACAAGTTTGTCTCrGGTTC^ 

GAGAGCACTTAACCAACGACGACAAGGATAAAATCCTCTTGCACAAGGACTTGGCAGCCAAACACGGCTGGAAA 

GTAGGGGACAAGGTTAAACTGGACTCTAATATCTACGATGCAGATAATGAAAAAGGAGCCAAGGAAACAGTTGA 

AGTGACAATCAAGGGACTCnTTGATGGTCATAATAAGTCAGCAGTAACCTACTCACAAGAACITTACGAAAACAC 

AG CTATT ACAGACATTCACACTGCTGCAAAACTTTATGGATACACAGAAGACACAGCCATTTATGGGGACGCAAC 

CI i CT l i GTAACAGCAGACAAGAACTTGGATGATGTTATGAAAGAGTTGAATGGCATCAGTGGTATCAACTGGAA 

GAGCTACACACTCGTCAAGAGCTCCTCrAACTACCCAGCTCTTGAGCA^ 

AACCTCCTCTTCrGGGGTAGCTTGAGCTTCTCAGTTCTCCTCCTTGCCCT 

TCGCAAGGAAGTGGGAATTCTCCTCTCTATCGGCCTCAAGCAGGCAAGTATCTTGGGTCA 

ATTCATCACCGAATCTATCrTGATTGCTATCCCTGCrcrAGTTTCTGCrTACTTCCT 

CAATTGGAAACACTGTCCTTGCCAATGT GACT rCAGGTGTTGCCAAACAGGCTAGTAAGGCGGCTCAAGCCTCTA 
ACCTT GGTGG TGGTGC AGAAGTAGATGGCTTTAGCAAGACCTrGTCGAGCCTAGACATTTCCATTCAGACATCAGA 
CnTATCATCATTITTGTCCTTGCCTTGGTrCTAGTGGTTCTCGTTAT^ 
AACAACCAAAAGAGCTCTTGCTGGATGGTGAATAA 
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4144.1 

ATGTCACAGGATA^CAAATGA^AAGCTGTTTCTCCCCTTCTGCAGCGAGTTATCAATATCTCATCGATTGTC GGTG 
GGGTTGGGAGTTTGATTTTCTGTATTTGGGCTTATCAGGCnXiGG 

ATCCAGCAGGCAGGCATCTGGGGTCCACCTCTCTTTATCrT 1 1 1 ' i'AC AG ATTTTAC AG ACTGTCGTCCCTATCATTCC 
5 AGGGGCCTTGACCTCGGTGGCTGGGGTCTTTATCTACGGGCACATCATCGGGACTATCTACAACTATATCGGCATC 
GTGATrGGCTGTGCCATTATCriMUATCTAGTGCGCCTATACGGAGCTGCCTTTGTCCAGTCTGTCGTCAGCAAGC 
GCACCTACGACAAGTACATCGACTGGCTAGATAAGGGCAATCGTTTTGACCGC II C I 1 rATTTTTATGATGATTTG 
GCCCATTAGCCCAGCTGACTXTCTCTGTATGCTGGCTGCCCTGACCAAGATGAGCTTCAAGCGCTACATGACCATC 
ATCATTCTGACCAAACCCITTACCCTCGTGGTTTATACC™^ 
10 GCTTTGA 

4144.2 

ATGAGAAATATGTGGGTTGTA^TCAAGGAAACCTATCTTCGACATGTCGAGTCATGGAGi'rrcri'C'l'l'l ATGGTGA 
TTTCGCCGTTCCTCTTTTTAGGAATCTCTGTAGGAATTGGGCATCTCCAAGGT^ 

15 GTGGCAGTAGTGACAACAGTGCCATCTGTAGCAGAAGGACrGAAGAATGTAAATGGTGTTAACTTCGACTATAAA 
GACGAAGCAj^GTGCCAAAGAAGCAATTAAAGAAGAAAAATTAAAAGGTTATTTGACCATTGATCAAGAAGATAGT 
GTTCTAAAGGCAGTTTATCATGGCGAAACATCGCTTGAAAATGGAATTAAATTTGAGGTTACAGGTACACT 
AACTG C AAA^TC AGCTT AATCGTTC AACTGCTTCCTTGTCTCA AG AGC AGG AAAAACG CTTAGCG C AG AC AATTC 
AATTCACAGAAAAGATTGATGAAGCCAAGGAAAATAAAAAGTTTATTCAAACAATTGCAGCAGGTGCCTTAGG 

20 T C ' riU ' ClM ' l ATATGATTCrrGATTACCTATGCGGGTGTAACAGCTCAGGAAGTTGCCAGTGAAAAAGGCACCAAAAT 
T ATGG AAGTCGT 1 ' 1" I " I ' I CT AGC AT AAGG GC AAGTC A CT ATTTCT ATGCGCGG ATG ATG G CTCTGTTTCT AGTG ATTT 
TAACGCATATTGGGATCTATGTTGTAGGTGGTCTGGCrGCCGTTTTGCTCTTTAAAGATTT^ 
TCTGGTATTTTGGATCACTTGGGAGATGCTATCTCACTGAATACCT 
CGTAGTCTTGGCAGCCTTCCTAGGATCrATGGTTTCTCGTCCTGAGGACTCAGGGAAAG 

25 • ATTTTGATTATGGGTGGTTTTTTTGGAGTGACAGCTCTAGGTGCAGCTGGTGACAATCTCCTCTTG 

TT AT ATTC CCTTT ATTTCG ACCT I C 1 T 1 ATGCCGTTTCG AACG ATTAATG ACTATGCGGGGGG AGCAGAAGCATGG 

ATTTCACTTGCTATTACAGTGATTTTTGCGGTGGTAGCAACAGGATTTATCGGACGCATGTATGCTAGTCT 

TCAAACGGATGATTTAGGGATTTGGAAAACCTrTAAACGTGCCTTATCTTATAAATAG 

30 4144.3 

ATGACAGAA^CCATTAAATTGATGAAGGCTCATACTTCAGTGCGCAGGTTTAAAGAGCAAGAAATTCCCCAA^^ 
GACTTAAATGAGATTTTGACAGCAGCCCAGATGGCATCATCTrGGAAGAATTTCCAATCCrACTCT^ 
TACGAAGTCAAGAGAAGAAJVGATGCCTTGTATGAATTGGTACCTCAAGAAGCCATTCGCCAGTCTGCTGTTTTCCT 
TCTCTTTGTCGGAGATTTGAACCGAGCAGAAAAGGGAGCCCGACTTCATACCGACACCTTCCAACCCCAAGGTGT 

35 GG AAGGTCTCTTG ATT AGTTCGGTCGATGC AG CTCTTGCTGG AC AAAACGCCT^ 

TATGGTGGTGTGATTATCGGTTTGGTTCGATACAAGTCTGAAGAAGTGGCAGAGCTCTTTA^CCTACCTGACrACA 
CCTATTCTGTCTTTGGGATGGCACTGGGTGTGCCAAATCAACATCATGATATGAAACCGAGACTGCCACTAGAGA 
ATGTTGTCTTTGAGGAAGAATACCAAGAACAGTCAACTGAGGCAATCCAAGCnTATGACCGTGTTCAGGCTGACT 
ATGCTGGGGCGCGTGCGACCACAAGCrGGAGTCAGCGCCTAGCAGAACAGTTTGGTCAAGCTGAACCAAGCTCAA 

40 CTAGAAAAAATCTTGAACAGAAGAAATTATTGTAG 

4146.1 

ATGTTAAAACTTATTGCTATTGTTGGAACAAATTCAAAACGT^ 
AACACTTTACTGACAAAGCTGAAATTGAACITGTTGAAATCA 
45 GCAAGTACCrGCTGAAATATTGGAAATTGCTGCTAAAATCGAAGAGGCAGATGGCGTTATTATC 
GTATGATCACTCTATTCCAGCTGTTTTGATGAGCGCTCTTGCTTC 

AACCAATCATGATTACAGGTGCTTCTTACGGTACGCTTGGTTCATCTCGTGCCCAATTGCAACTTCGTCA 
GAATGCTCCTGAAATCAAGGCAAATGTTCITCCAGATGAATTCITGCrCTCACACrCT 
AGTGGCGACTTGGTTGACCTTGATGTTATCAAGAAATTGGATGCCATCTTTGATGAC^ 
50 TCACAGAAAAATTACGTAATGCACAAGAATTACTTCGCAAAGATGCTGAAGACTTTC 

4146.2 

ATGAATACCTATCAATTAAATAATGGAGTAGAAATTCCAGTATTGGGATTTGGAACTTTTAAGGCTAAGGATGGA 
GAAGAAGCCTATCGTGCAGTGTrAGAAGCCTTGAAGGCTGGTrATCGTCATATTGATACGGCGGCGATTTAT CAGA 
55 ATGAAGAAAGTGTTGGTCAAGCAJVTCAAAGATAGCGGAGTTCCACGTGAAGAAATGTTCGTAACTACCAAGCTTT 
GGAATAGTCAGCAAACCTATGAGCAAACTCGTCAAGCTTTGGAAAAATCTATAGAAAAACTGG 
TGGATTTGTATTTGATTCATTGGCCGAACCCAAAACCGCTrCAGAGAAAATGACGCA 

AAGTTTGG AGAGCG ATGGAAG ACCTCTATCAAG AAGGG AAAATCCGTGCTATCGGCGTTAGCAA 1" l" ITCT TCCCC 
ATCATTTGGATGCCHTGCTTGAAACrGCAACrATCGT^ 
60 TCAAGATCAAGTCGTAGCTTACTGTCGTGAAAAGGGAATTTTATTGGAAGCrrGGGGGCCrrrTGGACAAGGAGA 
ACTGTTTGATAGCAAGCAAGTCCAAGAAATAGCAGCAAATCACGGAAAATCGGTTGCTCAGATAGCCTTGGCCTG 
G AG CTTGGC AG AAGG ATTTTTACCACTTCC AAA ATCTGTC AC AACCTCTCGT ATTC Aj\GCT 

GGAATTGAACTGAGTCATGAGGAGAGAGAAACCTTAAAAACGATTGCTGTTCAATCGGGTGCTCCACGAGTTGAT 
GATGTGG ATTTCT AG 

65 
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4147.1 

ATGAGGTGCAAAATGCTTGATCCAATTGCTATTCA^ 

TGACAGGCTTGATTCTTGCGGTTTATITGACCATGAAAGAAGCACCTAGAAAGAAGA TCArA CCAGACGATAT^ 
AGATTTTATCTTAGTAGCCITTCCCTTGGCTATTTTAGGAGCrrCGTCTCT 

TAGTCAGAATTTAGGAGAGA'riUMl'GCCATTTGGAATGGTGGTTTGGCCATTTACGGTGGTTTGATAACTGGGG^ 
CrTGTGCTCTATATCTTTGCTGACCGTAAACTCATCAATACTTGGGATTTTCT 

GATTGCTCAAAGTTTGGGGCGTTGGGGTAA'l'lMCIHU AACCAAGAAGCTTATGGTGCAACAGTGGATAATCTGGAT 

TATCTACCTGGCTTTATCCGTGACCAGATGTATATrGAGGGGAGCTACCGTCAACCGACTTTCCTTTATGAGTCT 

TATGGAATCTGCnTGGCnTTGCCTTGATTCTGATTTTTAGACGGAAATGGA^ 

GGCCTTTTACTTGATTTGGTATGGTTTCGGTCGTATGGTTATCGAAGGTATGCGAACAGATAGTCTCATGTTCTTCG 

GCTTTrCGAGTGTCCCAATGGCHXjTCAGTTGTCCTTATCGGTCT^ 

GAAGGCCCCTTACTATATTACAGAGGAGGAAAACTAA 

4147.2 

ATGGGTAAATTATCCrCAATCCTTTTAGGAACCGTTTCAGGTGCAGCTCTTGCCT^ 

GCAAACAAGTTTGCAGTCAGGCTCAAGATTTTCTAGATGATTTGAGAGAAGATCCGGAGTATGCCAAGGAGCAAG 
TCTGTGAAj\Aj\CTGACAGAAGTTAAGGAGCAGGCTACAGATTTTGTTCTGAAAACAA 

GTGAAATCACTGTGGACAGTATACTTGCTCAAACTAAATCCTATGCllUUCAAGCGACAGAAGCATCAAAAAATC 

AATTAAATAATCTCAAGGAGCAATGGCAAGAAAAAGCCGAAGCTCTTGATGACTCAGAAGAGATTGTGA^^ 

TAACAGAAGAATAA 

4147.3 . 

ATGAAAACTAAATTGATCTITI GGGGCTCTATGCTCTTTCTCCTCTCCCTCTCCATCCTTCTGACCATTTATCTGGC 

TTGGATTTTCTATCCTATGGAGATTCAGTGGCTAAACTTAACGAATCGAGTCTATCTAAAACC^ 

TACAA rrnCATATCTTGATGAATTATCTGACCAATCCl ITI'AGTCAGGTCTTACAGATGCCTGATnTCGTTCGTC 

AGCAGCTGGTCTGCACCATTTCGCAGTGGTCAAGAATCTCTTTCATTTGGTTCAGCTAGTAGCTCTAGTG 

CC AAGTTTCT ATGTCTTTGTC AAT AG G ATTGTG AAAAAG GAL" 1 1 1 1 I GTCTCTTTATCGAAAAAGTCTCCTGGCTCT 

AGTAGTCTTACCTGTGATGATTGGACTTGGGGGAGTTTTGATTGGTTTTGACCAATTCT^ 

TTCTCTTTGTGGGAGATGATACCTGGOriTTTGATCCAGCCAAGGATCCrGTTAT^ 

TTTCTTCATGCCTTCCTCCTCIH" ri'l" I'GCCCTCTATG AAAACTTCTT^ 

GTGA 

4149.1 

ATGACTTATCATTTTACTGAAGAATACGATATTATTGTAATTGGTGC 

CCGCTAGCCGTATGGGCTGTAAGGTCCTGCTTGCGACCATCAATATTGAAATGCTGGCTTTCATGCCTTGTAATCC 

CTCTATCGGTGGTTCTGCCAAGGGGATTGTCGTGCGTGAAGTCGATGCCCTCGGTGGCGAGATGGCCAAAACCATT 

GACAAGACTTACATCCAGATGAAGATGCTAAACACAGGGAAGGGGCCAGCTGTCCGTGCCCTTCGTGCGCAGGCT 

GACAAGGAACTTTACTCTAAGGAGATGCGCAAGACGGTTGAAAACCAAGAAAATCTGACCCTTCGTCAAACCATG 

ATTGATGAGATTTTGGTGGAAGATGGCAAGGTTGTCGGTGTGCGTACAGCCACCCATCAAGAATATGCTGCTAAG 

GCTGTTATTGTGACGACAGGGACTGCTCTCCGTGGGGAAATTATCATCGGAGACCTCAAGTACTCATCAGGTCCTA 

ACCACAGCTTGGCTTCTATTAACCTAGCTGACAATCTCAA 

AACCCCTCCACGTGTCAAGGC 1 1 C 1 I CTATCAATTACG ATGTGACAGAAATTCAGCCAGG AG ACGAAGTGCCTAAT 

CATTTCTCATACACTTCACGTGATGAGGATTATGTCAAGGACCAAGTACCATGCTGGTTGACCTATACCAATGGTA 

CCAGTCATGAGATTATCCAAAACAACCTCCACCGTGCGCCTATGTTTACAGGTGTGGTCAAGGGAGTGGGGCCTC 

GTTACTGTCCGTCGATTGAAGACAAGATTGTGCGCTTTGCGGACAAGGAj^CGTCACCAACTCTTCCTTGAGCCAGA 

AGGGCGCAATACTGAGGAAGTCTATGTGCAAGGACTTTCAACCAGTCTGCCTGAGGATGTCCAGCGTGACTTGGT 

GCATTCCATCAAAGGTTTGGAAAATGCAGAGATGATGCGGACAGGTTATGCTATTGA 

GTATGATATGGTCTTGCCTCATCAGTTGCGTGCGACTTTGGAAACCAAGA^ 

CAGACAAATGGAACATCAGGTTACGAAGAGGCAGCAGGCCAAGGGATTATCGCGGGTATCAATGCGGCTCTGAA 
AATCCAAGGCAAGCCTGAATTGATTTTGAAGCGCAGTGATGGTTATATCGGGGTGATGATCGACGACTTGGTGAC 
CAAGGGAACCATTGAACCCTACCGTCTCTTGACCAGTCGTGCTGAATACCGTCT 

GATATGCGCTTGACTGAGATGGGACGCGAGATTGGCCTTGTGGACGATGAACGCTGGGCTCGTTTTGAAATCAAG 

AAAAATCAATTTGATAATGAGATGAAGCGCCTAGACAGTATCAAACTCAAGCCAGTCAAGGAAACCAATGCCAAG 

GTTGAGGAGATGGGCTTCAAACCCTTGACCGATGCAGTGACAGCCAAGGAATTCCTTCGCCGTCCAGAAGTTTCTT 

ACCAAGATGTGGTGGCCTrCATCGGACCAGCTGCAGAAGACTTGGATGACAAGATTATCGAATTGATTGAAACAG 

AAATCAAGTATGAAGGCTATATTTCCAAAGCCATGGACCAGGTTGCCAAGATGAAACGCATGGAAGAAAAACGCA 

TTCCGGCCAATATCGACTGGGATGACATTGATTCTATCGCAACCGAAGCCCGTCAGAAGTTCAAACTCATCAATCC 

AGAAACCATCGGCCAAGCCAGCCGTATrrCGGGAGTAAACCCAGCAGATATTTCTATTTTGATGGTGTATCTGGAA 

GGTAAAAATCGTAGTATTTCTAAAACTCTTCAAAAATCAAAATGA 



4149.2 

ATGAAAGTATTAGCTTTTGATACGTCCAGCAAGGCTCTTTCTCTGGCTAT 

AG ACG ACG ATT AAT ATTAAG A A AAATCACAGT ATT ACTCTTATGCCTGCCATCG A [TTTT l 'G ATGGC A AGTTTGG A 
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TTGGACACCCAAGGATTTGGACCGAATCGTGGTAGCrGAj\GGGCCGGGTAGCTATACAGGCTTGCGAj\TTGCGGT 
AGCAACTGCTAAGACCITAGCTCACACCCTGAACATCGA 

CATCAACAAGAAGGTTTGTTTGTCCCCnTGATGGATGCGCGTCGCAATAATGTTTATGCAGGATTTTATGAA 

CCAAACCTGTCATGGCAGAAGCGCACCTATCTTTTGAAGAGGTGCTAGAAAAAGTCAAGGGTACTAGTCAGGTAA 

CCTTTGTCGGAGAAGTTGGCCCCTTTGTTGAGCAGATTCAAAAACACTTGCCAAGGACTGATTACAA^ 

GCCCAATGCAGCTAATCTAGCT C1 T 1 V GGCCTGGGACAAGGAAGCAGACTCCTTGCATGATTTTGTGCCGAATTAC 

CTCAAACGAGTCGAGGCTGAGGAAAACTGGCTCAAGAACCATACCGAGTCTGGCGAGTCTTACATTAAACGCCTA 

TGA 

4149.3 

ATGATAGAAATCAAGCGAATTCAACAACAGCCTGACCTAGCTCAAGCCATCTACGCTGTTATGGCAGCTGTTTACC 

TAGTCAGTCCITGGACTCTGGAGCAAATCCAAGCAGATCTGTCCCAAGACCAGACTTGGTATGCATTGGCTTATGA 

TGGGGCAGAAGTGATTGGATTTCTAGCTGTGCAGGAGAATCri l riGAAGCAGAAGTCCTGCAAATCGCTGTCAA 

AGGAGCTTATCAGGGTCAGGGGATTGCGTOgCCTTGTTTGCTCAATTGCCGACAGACAAGGAAATTTTCCTCGA 

GTCAGACAGTCAAATCAACGAGCGCAAGCATTTTACAAGAAAGAAAAGATGACAGTTATCGCTGAGCGAAAGGC 

CTACTACCATGACCCAGTCGAGGACGCCATTATCATGAAGAGAGAAATAGATGAAGGATAG 

4152.2 

ATGACAAAACAAGTCTTATTAGTGGATGATGAAGAACACATTCTGAAATTGCTTGACTACCATTTAAGTAAGGAA 

GGCTTTTCTACTCAATTGGTGACAAATGGACGGAAGGCCTTAGCTTTGGCAGAAACAGAACCCTTTGATT^ 

TGCTTGATATCATGTTACCACAATTAGATGGCATGGAAGTTTGTAAGCGGCTGAGAGCCAAAGGCGTCAAAACTC 

CAATTATGATGGTTTCTGCGAAAJVGTGATGAATTTGATAAGGTTTTGGCCTTGGAATTAGGGGCT 

GACCAAGCCTTTTAGCCCTAGAGAATTGCTGGCGCGTGTCAAGGCTGTCCTCAGGCGAACTAAAGGAGAACAAGA 

AGGAGATGATTCAGATAATATCGCTGACGATTCTTGGCTATTTGGGACCrTGAAAGTATACCCTGAGCGTCATGAA 

GTCTACAAGGCGAATAAGTTACTGAGTITGACCCCAAAAGAATTTGAAAGCGATAAAAATCCGrri TTTGAAGTTT 

TCAAAGTTTCGAAAGTAACCGCCCAATAA 

4154.1 

ATGACTACl I I 1 AAAGATGGAll I'l l ATGGGGTGGTGCTGTTGCTGCTCATCAACTTGAAGGTGGATGGCAAGAAG 

GTGGCAAGGGAATTAGTGTTGCrGATGTTATGACTGCTGGTCGTCATGGAGTAGCTCGTGAAATTACrrTTGGGAGT 

TTTAGAGGGTAAATATTATCCAAATCATGAGGCGATAGA 1 I ' l ' l ' l ATCACCGTTATAAAGAAGATATAGCACTTTTT 

GCTGAAATGGGATTCAAGTGCTTCCGTACCTCTATTGCATGGACACGTATCTTTCCAAAAGGTGATGAGTTAGAGC 

CGAATGAAGAAGGATTACAGTTTTATGATAATCT'f'l'l'lGATGAATGCTTAAAGAATGGTATTGAACCTGTCATCAC 

TCTATCTCATTTTGAAATGCCTTATCACrTAGTGACCGAATATGGTGGTTGGAAAAATAGGAAATTGATTGA^ 

TTTGCTCGTTTTGCAGAAGTCGTATTTAAACGTTACAAAGATAAGGTTAAATATTGGATGACTTTCAATC 

ATAATCAAGCGAATTATCAGGAAGATTTTGCACCATTTACTAACTCAGGTATTGTATATGAGGAAGGTGATAATAG 

AGAAGCAATTATGTATCAAGCAGCACATTACGAATTAGTTGCTTCTGCACGAGCTGTAAAAATTGGTCATGAGATr 

AATCCAGATTTTCAAATAGGTTGTATGATTGCGATGTGTCCAATTTATCCAGTTACTTGCAATCCTAAGGATATCTT 

AATGGCAATGAAAGCTATGCAGAAGCGTTATTATTTTGCTGATGTGCATGTTTTAGGTAAATATCCTGAGCATATT 

TTCAAGTATTGGGAACGAAAAGGTATTTCAGTTGATTTTACTGCCCAGGATAAAGAAGATTTA 

TAG ATTACATTGGTTTCAGTT ACTAT ATGTCCTTTTGCT ATCG ACTCTC ATCGTG AAAATAATCCTTATT^ ATT AT 

CTTGAAACAGAAGATTTAGTGAAAAATAATTATGTTAAGGCTTCTGAATGGGAGTGGCAAATTGATCCAGAA 

TTGCGTTATGCGTTAAATTGGTTTACAGACCACTATCACTTACCACT^^ 

AGATCAAGTTGCAGCAGATGGTATGGTACATGATGATTATAGAATTGAATATCTAGGTGCCCATATTCGTGAAATG 
AAAAAGGCTGTAGTTGAAGATGGTGTTGATTrAATGGGTTATACTCCATGGGGATGTAT^ 

GTACCGGTGAAATGCGGAAACGTTATGGCrrTATTTATGTAGATAAAGATGATAATGGGAAGGGAAGTTATAATC 
GTTCCCCGAAAAAATCTTTTGGCTGGTATAAGGAAGTTATTTCATCTAACGGTGAATCAGTAGAATAG 

4154.2 

ATGGATCAACAAAACGGGTTGTTTGG lU - rrcri GAAAACCATGTTATGGGACCAATGGGCAAACTTGCTCAGTTTA 
A AGT AGTACGTGCT ATCACGGCTG C AGGT ATG G CTGCTGT AC CATTT ACT ATTGTAGG ATC AATGTTTTTGGTATT 
CA GTAT TTTGCCACAAGC^TrCTCATTTTGGCCAATTGTGGCAGATATTTTCT 
CACirTACATGGTTGCAAACTATGCGACTATGGGTrCTCT 

ACAAAAATTTATGCAGAGGAAGAAGAACTCAATATGAATCCTCTTAATGGTGCCnTGCrTGCC^ 

TCATGACAGTACCGCAAATCATTTTTGATGGTGGAATGATGAAGACTGTGACAAGTCTAAAAGAAGGTGCAGTAA 

TTGCAGATGGATGGGCAATGGGAAATGTAGTCGCACGTTTTGGGACAACAGGGATTITTACCGCAATCATTATGG 

CAATTGTGACTGTTCTTATTTATCGTATGTGTGTTAAACATAA 

GGAGTTTCTCGTGGATTTACCGCTTTGGTTCCGCGATT^^ 

GTAGCAATGGGAACAGATATTTTTAAAGTCATTGCAATTCCATTTGGTTTTGTATCCAATCT 

TTGGTTTAATGATTATTTATCTAtTGACTCAACTACTTTGGATTGTAGGTATCCACGGTC 

TrTGTTAGTCCAATTGCTCTTGCTAACATGGCTGAAAATGCTGCTGGCGGGCACTTCGCT 

CTAATATGTTTGTAATTGCAGGTGGTrCTGGTGCAACITTAGGACTATGTTTATATATTG C 

GAACAGCITAAAGCAATAGGACGAGCATCTGTAGTTCCAGCCITATTTAATATTAATGAACCATTAAT^ 

TACCTATTATCTATAATCCAGCCITGGCrATACCATTTATTTTAGCACCAATGGTTACT 

GCGAATrCTCTAAACrrrATTAAGCCAATTATCGCACAGGTTCCATGGCCAACTCCAGTAGGGAT^^ 
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TAGGGACAGCAGATCirCGAGCTGTATTAGTTGCTCTAGTATGTGCATTTGCAGCATTCCTAGTCTATCTT 
ATCCGTGTATATGATCAAAAATTGGTGAAAGAAGAGCAAGGTATCTAA 

4155.1 

ATGAAAAAATTTTATGTAAGTCCAATTTTTCCTATTCTAGTAGGATTGATTG 

TA im ' 1 GTTAATAATAATCTGTTG ACGGTTTT AATTTTG 1 rTCT T TTTGTAGGAGGCTATG T T TTTTT ATTTAAGAA 
ACTGAGAGTGCATTATACAAGGAGTGATGTAGAACAGATACAGTATGTAAACCACCAAGCGGAAGAAAGTTTGAC 
AGCTCrATTGGAACAGATGCCTGTAGGTGTTATGAAATTGAATTTATCTTCT 
TATGCTGAATTGATTTTGACCAAGGAAGATGGTGATTTTGATTTAGAAGCT^ 

TAGGAAATCCGTCTACTTATGCCAAGCnTGGTGAGAAGCGTTATGCTGTTCATATGGATGCriCri CCGGTGTT^ 

GTATTTTGTAGATCTATCCAGGGAACAAGCCATAACAGATGAATTGGTAACAAGTAGACCAGTGATTGGGATTGT 

CTCTGTGGATAATTATGATGATTTGGAGGATGAAACTTCTGAGTCAGATATTAGTCAAATCAATAGTT^ 

AATTTTATATCAGAGTTTTCAGAAAAACACATGATGTTTTCTCGTCGGGTAAGTATGGA^ 

TGACTACACGGTGCTTGAGGGCTTGATGAATGATAAATTTTCTGTTATTGATGCnTTCAG 

AGACAGTTGCCOT'GACCTTAAGTATGGGAiU"lU > CU"I'ATGGCGATGGAAATCATGATGAGATAGGGAj\AGTTGC^ 
TGCTCAATTTCAACTTGGCTGAAGTACGTGGTGGCGACCAGGTGGTTGTTAA 

CAGTTTATTTTGGTGGTGGGTCTGCTGCTTCAATCAAGCGTACACGGACTCGTACGCGCGCTATGATGACAGCT 

TTCAGATAAGATTCGGAGTGTAGATCAGGTTTTTGTAGTCGGTCACAA 

GCTCTAGGTATGCAGTTGTTCGCCAGCAATGTGATTGAAAATAGCTATC 

CAGATATTGAACGAGCTGTTTCATTCATAGAAAAAGAAGGAGTTACGAAGTTGTTGTCTGTTAAGGATC 

GGATGGTGACCAATCGTTCTTTGTTGATTCTTGTAGACCATTCAAAGACAGCCTTAACATTATCAA^ 

TGATTTATTTACCCAAACCATTGTTATTGACCACCATAGAAGGGATCAGGATTTTCCAGATAATGCGGTTATTACT 

TATATCGAAAGTGGTGCAAGTAGTGCCAGTGAGTTGGTAACGGAATTGATTCAGTTCCAGAATTCTAAGAAAAAT 

CGTrTGAGTCGTATGCAAGCAAGTGTCTTGATGGCTGGTATGATGTTGGATACTAAAAATTTCACCTCGCGAGT^ 

CTAGTCGGACATTTGATGTTGCTAGCTATCTCAGAACGCGCGGAAGTGATAGTATTGCTATCCAGGAAATCGCTGC 

GACAGATTTTGAAGAATATCGTGAGGTCAATGAACTTATTTTACAGGGGCGTAAATTAGGTTCAGATGT^ 

GCAGAGGCTAAGGACATGAAATGCTATGATACAGTTGTTATTAGTAAGGCAGCAGATGCCATGTTAGCCATGTCA 

GGTATTGAAGCGAGTTTTGTTCTTGCGAAGAATACACA 

TGAATGTACAACGGATTATGGAAGAGTTAGGCGGTGGAGGCCACTTTAATTTGGCAGCAGCTCAAATTAAAGATG 

TAACCTTGTCAGA^GCAGGTGAAAAACTGACAGAAATTGTATTAAATGAAATGAAGG 

TGA 

4156.1 

ATGAAAGAGAAAAATATGTGGAAAGAATTGTTGAATCGTGCAGGCTGGATTTTGGT C1 1 T T f ACTTGCCGTCCTTT 
TATATCAGGTTCCCCTAGTGGTTACCTCTATTTTGACTTTAAAAGAAGTAGCCCTGCTACAGTC^ 
TGCTGGCCTTTCAATTGTGGTTCTGGCTCTATTTATTATG^ 
CTTTTTrTAGAGCTAAAGATTTGGCACGTTTGGGCITGAGTTATCT 

ATTTTATTGCAACTGTCAAATGAGACGACAACAGCTAACCAGTCTCAGATTAATGATATGGTTCAA^ 

TG ATTTCCAG rTTCTTCTl GCTAGCCTTG CTTG CTCCG ATTTGTG AGG AAATCTTGTGTCGTGGG ATTGTTCCTAAA 

ATTTACCITCTrrATTGATTTATGGAGGTATGTCGACAGTTCTATCTTGGACAGCCT 
AATGTCGATCTTGCTTCACATGATTGTTAj\TGGGATTGCTTTC 

C r I " I'l" 1' ACTTGCCGTCC" ITTI ATATCAGGTTCCCCTAGTGGTT ACCTCTATTTTG ACTTTAAAAG AAGT AG CCCTGC 
TACAGTCAGGGCTGATAGTTGCTGGCCnTCAATTGTGGTTCTGGCTCTATTrATTA 

GTTAGCTAGTTTTAAl'l'1 1 rCTTI 1 I 1 1 AGAGCTAAAGATTTGGCACGTTTGGGCTTGAGTTATCTAGTTATTGTCG 

GGTCAAATATACTTGGTTCCATTTTATTGCAACTGTCAAATGAGACGACAACAGCTAACCAGTCTCAGATTAATGA 

TATGGTTCAAAATAGTTCGTTG AT TTCCA GT T 1 CTTCI ' 1 GCT AG CCTTGCTTG CTCCG ATTTGTG AGG AAATCTTGT 

GTCGTGGG ATTGTTCCTAAAAAG A ITTI CCG AGGC AAGG AG AACTTGGG ATTTGT AGTCGGT ACG ATTGTGTTTGC 

TTTATTGCATCAACCAAGTAATTTACCTTCTTTATTGATTTATGGAGGTATGTCGACAGTTCT 

ACAAGACCCAACGTTTGGAAATGTCGATCTTGCnTCACATGATTGTTAATGGG 

GTGGTGATTATGAGTCGGACATTAGGAATTTCTGTTTAA 

4156.4 

ATGGATACACAAAAGATTGAAGCGGCTGTAAAAATGATTATCGAGGCTGTAGGAGAGGACGCTAATCGCGAGGGC 
TTGCAGGAAACACCTGCTCGTGTAGCCCGTATGTATCAAGAGATTTTTTCAGGTCTTGGTCAAACAGCAGAGGAAC 
ATTTGTCAAAATCCHTTGAAATTATTGACGATAATATGGTGGTAGA 

ACACCACITCTrGCCATTTTATGGTAGAGCGCACATTGCCTACATTCCAGATGGTCGTGTGGCAGGCrTGT 

CTAGCCCGTACGGTTGAAGTTTATTCGAAAAAACCACAAATTCAAGAACGTTTGAATATCGAAGTGGCCGATGCC 

TTGATGGACTATCTAGGTGCTAAAGGAGCCTTTGTTGTCATTGAGGCGGAACATATGTGTATGAGTATGCGTGGTG 

TTAGAAAACCAGGCACTGCAACCnTGACGACAGTAGCrCGTGGTCTATTTGAAACAGATAAGGATCTCCGTGACC 

AAGCTTATCGTTTAATGGGGCTATAA 

4157.2 
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ATGAAAGACTTGlTlTrAAAGAGAAAGCAGGCCTTTCGTAAGG 

ACCACTTTGTCTTGTTCCTGCrrGTCCTGTTGGGCTrrCTAGCCT 

GAAAATCATTGGCCTATCCTTTTGTTTGTAGGAATTACGTCTGTITTACT 

TATGGAGGCTCCAGACAAGCTCTTTCTCTTAGTTGGAGAAGAGGAAATTAAGCTCCATCTCAAGCGTCA 
CATTTCCCTAGTCrrirGGCTCTTTGTACAGACCCri'riC'll GCTGTTATTTGCGCCTTTArrrri'AGCAATGGGTTA 
TGGCTTGCCAG lTn l CTGCTCTATGTG Cl ATTGGGGGTAGGAAAATATTTCCACTTTTGTCAAAAGGCCAGCA 
AATTTTTCACTGAAACTGGACTGGACTGGGACTATGTTATTTCTCAAGAAA 

1 T rCTTI GCCCTCTTTACGCAGGTCAAGGG AATTTC AAACAGCGTT AAGCGTCGTGCCTATCTGG ACTTTATTITAA 
AGGCTGTTCAGAAGGTGCCTGGGAAGATTTGGCAAAATCTCTATCTGCGTTCITATCTGCGAAATGGCGA 
TGCTCTCAGTCITCGTCTTCTCTTGCTTTCCTTGCTGGCGCAGG 1 ri ' l r ATCGAGCAAGCTTGGATTGCGACAGCAG 
TGGTAGTTCTCTTTAACTACCTCTTGCT 

CAACTCTTTCCGCTGGACAAGGGGCAAAAGGAAAAAGGCTTACAGGAGGTAGTTCGAGGATTGACCAGTTTTG^ 

TTACTTGTGGAATTAGTTGTTGGGTTGATTACCTTCCAAGAAAAACTAGCCCTTCTAGCC^ 

TGGTTTTACrAGTCrTGTATTTGCCTTATCAGGTAAAACGTCAGATGCAGGACTAA 

4158.1 

ATGAGAAAATCAATAGTATTAGCGGCAGATAATGCCTATCTTATTCCTTTAGAGACGACTATAAAGTCTG 

ATCACAATAGAGATGTTGATTTTTATATTCTCAACAGTGATATAGCTCCTGAATGGTTTAAATTATTGGGGAGAAA 

AATGGAAGTTGTGAATTCTACAATTCGCAGTGTACACATTGATAAAGAACTTTTTGAAAGCT 

CATATAAATTATGCri'Cn ACTTTAGA IM^rrrrGCGACAGAAGTGGTTGAATCTGATAGGGTATTGTATCTGGATTC 

CGATATCATTGTAACTGGGGAACTAGCTACTTTGTTTGAGATAGATCTCAAAGGATATTCAATTGGTGCTGTTGAT 

GATGTCTATGCCTATGAAGGACGAAAATCTGGATTTAATACTGGTATGTTACTAATGGATGTTGCAAAGTGGAAAG 

AACATTCTATTGTCAATAGTTTATTGGAATTAGCGGCCGAGCAGAATCAAGTrGTTCATCTTGGGGATCAGAGTAT 

TTTAAATATTTATTTTGAGGATAATTGGCTAGCCTTAGATAAAACATATAATTATATGGTGGGTAT^ 

ACCTTGCTCAAGAATGTGAACGTCTAGATGACAATCCACCTACAATTGTTCACTATGCTAGTCATGATAAA CCTTG 

GAATACATATAGTATATCTAGACTACGTGAATTATGGTGGGTTTATAGAGATTTGGATTGGTCAGAGATTGCTTTT 

CAACGTTCCGATTTAAATTATTTTGAAAGAAGCAATCAGTCTAAAAAACAAGTGATGCT^ 

GATATAAAACATTTAGAGTATTTAGTACAACGGTTACCTGATTGGCATTTTCATTTGGCTGC^ 

TGAGGAGCTGACCTCTCTATCACAGTATACGAATGTAACAGTATATCAAAATGTATTACATAGTAGAATTGATTGG 
CTATTGGACGATTCTATAGTTTATTTAGATATTAATACAGGTGGAGAGGl'ri Tl AATGTAGTTACAAGGG CACAAG 
AAAGTGGCAAGAAAATCTTCGCrri'lGATATCACACGTAAAAGTATGGATGATGGACTCTATGACGGTAl l 111 1C 
TGTGGAGAGACCAGATGATTTAGTGGATAGAATGAAGAATATAGAGATAGAGTAA 

4158.2 

ATGACTAAGATTTATTCGTCAATAGCAGTAAAAAAAGGACTATTTACCTC 

GAAGTCGTATTATTCTCC C 1 1 1 1 GTTGACCTAAATACTAAAGA T TTTT T AGG AGGTTCAACAGCCTATCTAGCCTTC 

TCAGCCGCCCTAACAGGTGGGAATCTAAGAAGTTTATCAATTTTra 

TGATTTTATGGCAGATGTTTTCTTTTTCTAAACGGTTGGGTTTAACAT 

AAAATGTACCTGACCTTGCTAATTGCTGTGATTCAATCCTTGGCAGTTAGCTTGAGACTGCCAGTACAATCCTCCT 
ATTCTGCAATATTGGTTGTTCT AATG AATACAATATTGCTG AT AGCAGG AACA nTTn L 1 I GTTTGGTTGTCAGAT 
rrAA ATGCGAGTATGGGGATTGGAGGTTCTATTGTAATCCTCCTATCCAGTATGGTTTTAAATATrCCTC AGGAT G 
TTTTGGAAACATTTCAGACAGTACACATTCCAACAGGGATTATTGTGTTACTTGCTTTATTAACCC 
TATTTACITGCCCTTATGTATCGAGCTCCCTATITGGTTCCT 

CTATTCTTATCTCG AAATCATGTTG AATCCTGCAGGTGGGATGCCTTATATGTATGTG ATGAG nTTCTI AGTGTAC 

CAGCTTATTTGTTCATCTTGTTGGGATTTATTTTCCCTAATCATTCAGGGTTA 

GTTGGAAAGCCTTTGTGGGTCTATGTTTATATTTCGGTCTTATTTTTATTTAGTATCAT^ 

AATGGAGAAGAGATTGCAGACCGTATGAAAAAATCTGGAGAATACATTTATGGTATTTATCCAGGTGCGGATACT 

AGTCGATTTATTAATCGATTGGTCCTTCGTTTCTCAGTCATAGGTGGTCTCTTTAATGTGATTATGGCAGG 

CAT GCTTT TTGTTTTGTTTGATGAAAAGTTATTACGATTGGCAATGATTCCAGG 

TGATTTTTACGATTAGAGACGAGGTCAAGGCTTTAAGGCTAAATGAGACCTATAGACCTTTGATTTAG 
4158.3 

ATGTCCTCTCTTTCGGATCAAGAA.TTAGTAGCTAAAACAGTAGAGTTTCGTCAGCGTCTTTCCGAGGGAGAAAGTC 
TAGACGATATTTTGGTTGAAGCTTTTGCTGTGGTGCGTGAAGCAGA 

TGTTCAAGTCATGGGAGCTATTGTCATGCACTATGGAAATGTTGCTGAGATGAATACGGGGGAAGGTAAGACCTT 
G AC AGCTACCATGCCTGTCTATTTG AACGC l'l TT1 C AGG AG AAGG AGTG ATGGTTGTG ACTCCTAATG AGTATTTA 
TCAAAGCGTGATGCCGAGGAAATGGGTCAAGTTTATCGTTTTCTAGGATTGACCATTGGTGTACCATTTACGGAAG 
ATCCAA AGAAG GAGATGAAAGCTGAAGAAAAGAAGCTTATCTATGCITCGGATATCATCTACACAACCAATAGTA 
ATTTAGGTTTTG ATTATCTAAATG AT A ACCT AGCCTCG AATG AAG AAGGTAAG 1' l'l "l T ACG ACCGTTTAACT ATGT 
GATTATTGATGAAATrGATGATATCTTGCTTGATAGTGCACAAACT 

AGTCTAATTACTATGCGATCATTGATACACTTGTAACAACCrTGGTCGAAGGAGAGGATTATATCITTAAAGAGGA 

GAAAGAGGAGGTTTGGCTCACrACTAAGGGGGCCAAGTCTGCTGAGAATTTCCTAGGGATTGATAATTTATAC^ 

GGAAGAGCATGCGTCTTTTGCTCGTCATTrGGTrTATGCGATTCGAGCTCATAAGCTCTTrACT 

TATATCATTCGTGGAAATGAGATGGTACrGGTTGATAAGGGAACAGGGCGTCTAATGGAAATGACTAAACTTCAA 

GGAGGTCTCCATCAGGCTATTGAAGCCAAGGAACATGTCAAATTATCTCCTGAGACGCGGGCTATGGCCTCGATC 
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ACCTATCAGAGTCTTTTTAj\GATGTTTAATAAGATATCTGGTATGACAGGGACAGGTAAGGTCGCGGAAAAAGAG 
TTTATTGAAACTTACAATATGTCTGTAGTACGCATTCCAACCAATCGTCCGAGACAACGGA 

TTGACTATCCAGATAATCTATAT ATCA CTTTACCTGAAAAAGTGTATGCATCCTTGGAGTACATCAAGCAATACCA 

TGCTAAGGG AAATCCTTT ACTCG l'l'I'l'l GT AGGCTCAGTTG AAATGTCTC AACTCTATTCGTCTCTCTTGTTTCGTG 

AAGGGATTGCCCATAATGTCCTAAATGCTAATAATGCGGCGCGTGAGGCTCAGATTATCTCCGAGTCAGGTCAGA 

TGGGGGCTGTGACAGTGGCTACCTCTATGGCAGGACGTGGTACGGATATCAAGCTTGGTAAAGGAGTCGCAGAGC 

TTGGGGGCTTGATTGTTATTGGGACTGAGCGGATGGAAAGTCAGCGGATCGACCTACAAATTCGTGGCCGTTCT 

TCGTCAGGGAGATCCTGGTATGAGTAAATTTTTTGTATCCITAGAGGAT 

TGGGTGCATAAAj^GTACAAAGACTATCAGGTTCAAGATATGACTCAACCGGAAGTATTGAAAGGTCGTAAATAC 

CGGAAACTAGTCGAAAAGGCTCAGCATGCCAGTGATAGTGCTGGACGTTCAGCACGTCGTCAGACTCTGGAGTAT 

GCTGAAAGTATGAATATACAACGGGATATAGTCTATAAAGAGAGAAATCGTCTAATAGATGGTTCTCGTGACTTA 

GAGGAT GTTGT TGTGGATATCATTGAGAGATATACAGAAGAGGTAGCGGCTGATCACTATGCTAGTCGTGAATTAT 

TGTTTCACTTTATTGTGACCAATATTAGTTTTCATGTTAAAGAGGTTCCAGATTATATAGATGTAACT 

GCAGTTCGTAGCTTTATGAuAGCAGGTGATTGATAAAGAACnTTCrGAAAAGAAAGAATTA 

TATATGAACAG I'l l 1 I ACGACTTTCACTGCTTAAAGCCATTGATGACAACTGGGTAGAGCAGGTAGACTATCTACA 
ACAGCTATCCATGGCTATCGGTGGTCAATCTGCTAGTCAGAAAAATCCAATCGTAGAGTACTATCAAGAAGCCTA 
CGCGGGCTTTGAAGCTATGAAAGAACAGATTCATGCGGATATGGTGCGTAATCTCCTGATGGGGCTGGTTGAGGT 
CACTCCAAAAGGTGAAATCGTGACTCATTTTCCATAA 

4158.4 

ATGATAGGGACrrrCGCCGCTGCTCTTGTAGCTGTACTAGCAAATTTCATCGTCCCTATTGAAATTAC 

GTGCCAATACTGAAATTGCACCACCAGATGGGATTGGGCAGGTTCTCAGCAACCTCTTGCrCAAACTGGTTGACA 

ACCCAGTCAACGCCCTGCTTACTGCTAACTATATTAGAATCTTATCTTGGGCAGTCATTTT^ 

AGAAGCCAGTAA^\AATAGTCAAGAATTGCTAAAAACTATCGCTGACGTGACT 

CAATCTG GCTC CATTrGGAATCCTTGGTCTTGTTTTTAAAACCAriTCT 

TACGGTATTTTATTGGTTCTATTAGTAACGACTATGCTTTTTG 

TTTATGAGACGCAATCCTTACCCTCTAGTTTGGAACTGCCTCCGTGTCAGCGGTGTGACAGCCTTTTTCACT 

GTTCTGCGACTAACATTCCTGTCAACATGAAACTCrGCCATGACCTTGGACTCAACCCAGATACCTATTCT 

TATCCCACTCGGTTCTACTATCAuATATGGCTGGAGTAGCGATTACCATTAACCTTTTGACCCTTGCTGCAGTTAAC 

ACTCTTGGAATTCCTGTTGACTTTGCCACAGCCTITGTCCTCAGTGTGGTAGCAGCTATCTCATCCTGTGA 

AGGTATTGCCGGAGGTTCCCTCCTTCTTATCC^ 

AAATTGTTGGGGTTGGTTTTGTGATTGGTGTCATCCAAGACTC^ 

CCTCTTTACCGCCGTTGCCGAATACGCAGCAACCCGTAAAAAATAA 

4158.5 

ATGTCTATTAGCCAACGTACGACCAAGCTCATCTTAGCTACCTGTCTTGCC 
TTCGTCAGCAGTTTCT 

CGCAATCGTC l 1 I l'l lCTATGCTTCTAGCTCTGGCTATCGGTGTTCTAGClUU'lCACXrGAGCGGATTTCATATCTG 
GAG TCTCG GCCTCTATCTGGCCTTCTACGTTCCTTTACCCT^ 

ACTGTTTTGGTTAGCCATCTCTTGGTTCAAGAGTCAACCTCTCCAGACCTTCTAGTCAATGAATT^ 
TATTGGTACAGGATTTGCCTTGCTTGTTAATCTCTA 

CTGGTGGAAGAAAAGTTAAAAGATATCCTCCAGCGCTTCAAATACTATTTATCCAGAGGAGACGGACGCAACCGA 

G CACA GCTGGTAGC AG AATTAG ACACGC till G AAAG AAGCCCTCAG ACTGGTCT ATTTGG ATCACTCTGACCACC 

TCTTTCACCAGACAGACTACCATATCCACTACTTTGAGATGAGACAGCGACAAAGTCGTATCCTGAGAAACATGG 

CCCAACAGATTAACACTTGTCACCrTGCCGCCAGTGAAAGCCTGATCTTAGCGCAACTCTTTT 

TCAACTGAGCCAGACCAATCCTGCTTCTGATTTGCTAGATGAAATTGAACGTTATCTGGAAGTCTr 

AGTCTGCCCAAGACAAGAGAAGAATTTGAAACCCGCGCCACCCTTCTTCAmACTCCTACGTGAAGCCAAAAC 

ATCCAAGTAAAAGTTGATTTTTACCAAAAATATAGACAGTAA 

4158.6 

ATGGAAATCATGTCGCTTGCGATTGCTGTTTTTGCCGTCATCATTGGTTTAGTCATTGGATATGTCAGCATCTC 

TAAGATGAAATCATCTCAGGAAGCrGCAGAGTTGATGCTTTTAAATGCT 

ACAAGCTGAGCGTGAAGCGGATTTACTTGTTAATGAAGCCAAACGTGAAAGCAAGTCT 

ATTGGAGGCCAAAGAAGAAGCCAGAAAATACCGTGAAGAAGTGGACGCTGAATTCAAATCAGAACGTCAAGAAC 
TCAA^CAAATCGAAAGTCGTTTGACAGAGAGAGCTACTAGCCTTGACCGTAAGGACGACAATTTGACGAGTAAAG 
AACAAACACTTGAACAAAAAGAACAAAGTATTTCTGATAGAGCGAAAAA^ 

AGGAAGTCGAAAGACAAAAAGAAGCAGAACTAGAGCGTATTGGTGCGCTGTCTCAGGCAGAAGCACGAGATATT 

ATCrTGGCTCAGACAGAGGAAAACTTGACCAGGGAGATTGCCAGTCGCATTCGCGAAGCTGAGCAAGAGGTCAAG 

GAACGTTCTGACAAAATGGCCAAGGACATCTTGGTTCAAGCTATGCAACGTATCGCTGGTGAATATGTAGCGGAG 

TCAACAAACTCAACAGTTCATCTGCCAGACGATACTATGAAGGGACGCATTATTGGTCGTGAAGGTCGTAACATT 

CGTACCTTTGAAAGTTTGACAGGGGTCGATGTGATTATCGACGATACACCAGAAGTGGTGACCITGTCAGGATTTG 

ATCCGATTCGTCGTGAGATTGCCCGTATGACTATGGAAATGTTGCTCAAAGATGGTCGTATACATCCAGCTCGTAT 

CGAAGAGTTGGTTGAGAAAAACCGTCAAGAGATTGACAATAAGATTCGTGAATACGGTGAGGCTGCTGCCTATGA 

AATT GGTG CGCCAAACCTTCATCCAGACTTGATGAAGATTATGGGACGTTTGCAGTTCCGTACTTCATATGGACAA 

AATGTTTTGCGCCATTCGATTGAGGTTGCTAAGTTGGCTGGTATCATGGCGAGCGAACTTGGTGAAAATGCGGCTC 
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TTGCCCGTCGTGCTGGATTCCTTCACGATA.TCGGGAAAGCCATTGACCATGAGGTTGAAGGTAGCCACGTTGAAAT 
CGGTATGGAATTGGCCCGTAAGTACAAGGAACCCCCAGTTGTGGTGAATACGATTGCTAGTCACCACGGAGATGT 
TGAAGCTGAGAGCGTGATAGCAGTTATCGTCGCTGCAGCAGATGCCTTGAGCGCAGCCCGTCCAGGTGCTCGTAG 
TGAGTCTCTTGAAj\GCTACATCAAGCGTCTCCATGATT^^ 
5 AGCTTTGCCCTTCAAGCAGGACGTGAAATTCGTATCATGGTCAATCCAGGAAAAATCAAGGACGACAAAGTCACA 
ATCTTGGCTCACAAAGTTCGTAAGAAAATTGAAAACAATCTCGATTATCCAGGAAATATCAAGGTAACCGTGAT^ 
CGCGAGCTTCGTGCAGTAGATTATGCTAAATAA 

4158.7 

10 ATGATGTTAAAACCCTCTATTGATACCTTGCrCGACAAGGTTCCTTCAAJVATATTCACT 

AACGTGCCCACGAATTGGAAGCAGGTGCCCCAGCAACTCAAGGTTTCAAGTCTGAAAAATCAACTCTTC 
TAGAAGAAATCGAATCAGGAAACGTTACA^TTCACCCAGATCCAGAAGGAAAACGTGAAGCAGTGCGTCGCCGTA 
TCG AAG AAG AAAAACG CCG C AAAG AA G AAG AAG AAAAG AAAATC AAAG AG C AAATTG CT AAAG AAAAAG AAG A 
TGGTGAAAAAATTTAA 

15 

4161.1 

ATGTCAGCATATCAATTACCGACCGTATGGCAGGATGAAGCTAGTAATCAAGGAGCT TTTAC GGGGCTAAACAGA 
CCAACAGCAGGTGCCCGTTTCGAACAAAACTTGCCAAAAGGAGAACAAGCITTTCAGC^ 
CCAAATGGTGTGAAGGTTACTATCTTATTGCAAGAATTACT^ 
20 ATAJVGATTGCTATCATGGATGGGGATCAATTCGGATCAGACTTTGTGAAGCTCAATC CAAAT TCCAAGATTCCAGC 
CTTATTGGACCAGTCAGGTACTGAAAACGTAAGAGTCTTTGAGTCTGCTCATATTCri CTTT ACCTTGCTGAGAAA 
TTTGGAGCCnTTTTACCAAGTAATCCTGTGGAAAA 

CACCTTTrCTAGGTGGGGGATTTGGACATTTCTTCAATTATGCTCCTGAAAA^ 

ACGATGGAAGTGAAACGCCAGTTGGATTTATTGGATAAGGAATTGGCTCAGAAACC TTATA TTGCAGGCAATGAC 
25 TATACGATTGCAGATATTGCTATCTGGTCTTGGTATGGACAGTTAGTTCAAGGAAATCTTTACCAAGGTTCTGC 
AATTCITGGATGCCTCAAGTTATCAAAATCTAGTAAAATGGG 
GTGGCTTGGAAGTAACTTATACAGAAATTAAATAG 

4161.2 

30 TTGGCAAGCTTGATCACTTCTATCATCATGTTCTATGTCGGTTTCGATGTTCTAAGAGATACCATTCAAAA 
CAGTCGGGAAGAAACGGTCATTGATCCTCTTGGTGCAACTCTAGGAAT^ 
TATCTCTACAj\TACTCGCCTCAGTAAGAAATCCAACTCCAATGCGCTC 
ACGCTGTTACCTCACTTGGAACCGCCATTGCCATCCTAGCTAGTAGTTTC 

TGCTATCATCATCA CrnCl Tl ATCTTGAAGACTGCCTATGATATCTTCATCGAGTCTTCCTTTAGTCTTTCAGATG 
35 GCTTTGACGACCGCCTGCTCGAGGACTACCAAAAGGCTATCATGGAAATTCCCAAAATCAGCAAGGTC AAATCG C 
AAAGAGGTCGCACCTACGGTAGCAACATCTACCTGGATATTACACTAGAG ATG AATCCTGACTTGTCTG IT1 1 iGA 
AAGCCATGAAATCGCGGATCAGGTCGAGTCTATGCTGGAGGAGCGTTTTGGCGTCTTTGATACCGATGTCCATATC 
GAACCAGCACCTATCCCTGAGGATGAj\ATTTTAGACAATGTCTATAAAA^ 

ACCAAGGAAACCAj\CTAGAAGAACTCTTGACTGATGATTTTGTCTATATTCGCCAAGATGGAGAGCAGATGGATA 
40 AAGAGGCTTATAAGACCAAAAAAGAGTTAAATTCTGCTATCAAGGACATTCAAATTACnTCCAT 

CCAA ACTCATCTGCTATGAGTTAGATGGTATCATCCATACCAGTATCTGGCGTCGCCACGAAACCTGGCAAAATAT 
CTTTCATCAAGAAACCAAAAAAGAATAG 

4162.1 

45 ATGACAATTAAACTAGTAGCAACGGATATGGACGGAJVCCTTCCTAGATGGGAATGGACGCTTTGATATGGATCGT 
CTCAAGTCTCTCTTGGTTTCCTACAAGGAAAAAGGGATTTACn 

TAGAAAAATTATTTGCTGGTGTTCGTGATGACATTATTTTCATCGCGGAAAATGGCAGTTTGGTA 
TCAGG ACTTGTATG AAGCG ACT ATGTCTCGTG AC 1111 ATCTGGC AAC 1" I 1 I G AAAAGCTG AA AACTTCACCTTAT 
GTAGATATCAATAAACTGCTCTTGACGGGTAAGAAGGGTTCATATGTTCTAGATACGGTTGATGAG ACCTA TTTGA 
50 AAGTGAGTCAGCACTATAATGAjVAATATCCAAAAAGTAGCGAGTTTGGAAGATATCACAGATGACATTTTCAAAT 
TTACAj\CCAACTTCACAGAAGAj^CGCTGGAAGATGGGGAGGCTTGG 

ccatgacaactggctttgajvtccattgatattgttctg 

taaaa aacttggtatcacaatgg atcaggtcatggc it itgg ag acaatcttaatg acttacatatg atgcaggtt 
gtgggacatcctgtagctcctgaaaatgcacgacctgajvattttagaattagcaaagactgtgattggtcacca 
55 aggaacggtcggttatagcttatatggagggcttataa 

4162.2 

ATGGCAGATATAAAATTGATTGCATTGGACnTGGACGGGACC^ 

ACCAAGGAAACCTTGCAAGCTGCGCGTGATCGTGGTATCAAGGTCGTATTGACAACTGGTCGTCCCnTAAAAGCC 
60 ATGGATTTCTTTCTCCATGAGTTAGGGACTGACGGTCAGGAAGATGAGTATACCATTACTTTTAATGGTGGATTAG 
TTCAGAAAAATACAGGAGAAATCCTTGATAAAACAGTCTTTTCATATGATGATGTGGCACGTTTGTATGAAGAAAC 
AGAGAAATTATCACTGCCTCTTGATGCCATCTCAGAAGGAACAGTTTATCAAATCCAATCGGACCAAGAAAGTCT 
TrATGCCAAATTCAATCCAGCTTTGACCTTTGTTCCAGTGGACTTTGAAGACTTATCTAGTCAA 
AAATGCGTGACTGCCTTTGCTCAAGAACCCTTGGATGCAGCCATTCAGAAGATTTCTCCAGAATTGT^ 
65 ATGAAj\TCTTTAAATCACGTGAAATGTTGCTAGAATGGTCACCAAAGAATGTTCATAAAGCAACAGGTTTGG 
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AACTAATCAGCCATCTTGGAATCGACCAAAGTCAAGTGATGGCTTGTGGTGACGAGGCCAATGACCTCrCTATGA 
TTGAATGGGCAGGTCTTGGTGTTGCTATGCAAAACGCTGTTCCTGAAGTAAAGGCAGCCGCAAATGTAGTGACGC 
CGATGACCAACGATGAGGAAGCTGTCGCCTGGGCTATCGAAGAATATGTGCTAAAGGAGAACTAA 

4164.2 

ATGGAAAGTTTACTTATTCTATTATTAATTGCCAATCTAGCTGGTCTCTT^ 

AGGAGAAACACTrAAGTAAGAGCTTGGAGGATCAGGCAGATCATTTGTCAGACCAGTTGGATTACCGCTTTGACC 

AAGCCAGACAAGCCAGCCAGTTAGACCAAAAAGATTTGGAAGTGGTTGTCAGCGACCGTTTGCAAGAAGTGCGGA 

TTGAATTGCACCAJ^GGTCTGACCCAAGTCCGTCAAGAAATGACAGATAATCTCCTCCAAACTAGAGACAAGACAG 

ACCAACGTCTCCAAGCCTTGCAGGAATCAAATGAGCAACGTTTGGAACAAATGCGCCAGACGGTCGAGGAAAAAC 

TAGAAAAGACCTTGCAGACACGCTTACAGGCTTCCTTTGAGACAGTTTCTAAACAACTGG 

CCTTGGAGAAATGCAGACAGTTGCCCGTGATGTCGGAGCTCTTAACAAGGTTCTCTCTGGAACCAAGACGCGAGG 

GATTCTGGGAGAATTGCAACTGGGGCAAATTATTGAAGACATCATGACACCTGCCCAGTACGAACGAGAATACGC 

AACGGTTGAAAACTCTAGTGAACGAGTGGAGTATGCCATCAAGTTACCCGGACAAGGCGACCAAGAATACGTCTA 

TCTGCCAATTGACTCTAAGTTTCCACTGGCAGATTATTACCGCTTGGAAGAAGCCTATGAGACAGGTGACAAGGAT 

GAGATTGAACGCTGTCGTAAGTCACTCCTAGCAAGCGTCAAGCGCTTTGCTAGGGATATTAGGAACAAGTACATA 

GCACCACCTCGGACGACCAATTTTGGAGTTTTGTrrGTTCCGACAGAAGGTCTCTACTCAGAAAT 

CGGTCI'ICi'riGATGATTTGAGACGGGAAGAACAGATTATTGTTGCAGGACCAAGTACCCTATCAGCCCTTCnTAA 

CTCCCTATCAGTTGGTITCAAGACCCTTAATATCCAAAAGAGTGCCGACCATATCAGCAAGACTCTTGCCAGTGTC 

AAGACCGAGTTTGGCAAGTTTGGTGGTATTCTGGTCAAGGCACAAAAACATCTCCAACATGCCTCTGGCAATATTG 

ATGAATTATTAAACCGTCGTACCATAGCTATCGAGCGGACGCTCCGTCACATTGAGTTGTCAGAAGGTGAGCCTGC 

GCTTGATCTACTCCATTTTCAAGAAAATGAGGAAGAATATGAAGATTAG 

4164.3 

ATGAAGATTAGTCACATGAAAAAAGATGAGTTATTTGAAGGCTTTTACCTAATCAAATCAGCTGA 

ACTCGAGCTGGGAj\AAACTACCTAGCCTTTACCTTCCAAGATGATAGTGGCGAGATTGATGGGAAGCTCTGGGAT 

GCCCAACCTCATAACATTGAGGCCTTTACCGCAGGTAAGGTTGTCCACATGAAAGGACGCCGAGAAGTTTATAAC 

AATACCCCTCAAGTCAATCAAATTACTCTCCGCCTGCCTCAAGCTGGTGAACCCAATGACCCAGCTGATTTCAAGG 

TCAAGTCACCAGTTGATGTCAAGGAAATTCGTGACTACATGTCGCAAATGATTTTCAAAATTGAAAATCCT 

GCAACGGATTGTCCGAAATCTCTACACCAAGTATGATAAGGAATTCTACTCCTATCCAGCTGCCAAGACCAACCA 

CCATGCCTTTGAAACGGGCITGGCCTATCATACGGCGACCATGGTGCGTTTGGCAGACGCTATTAGCGAAGTTTAT 

CCTCAGCTCAATAAGAGCCTGCTCTATGCGGGGATTATGTTGCATGACTTAGCTAAGGTCATCGAGTTGACGGGGC 

CAGACCAGACAGAGTACACAGTGCGAGGTAATCTTCTTGGACATATCGCTCTCATTGATAGCGAAATTACCAAGA 

CAGTTATGGAACTCGGCATCGATGATACCAAGGAAGAAGTCGTTTTGCTTCGTCATGTCATCCTCAGTCACCACGG 

CTTGCTTGAGTATGGAAGCCCAGTCCGTCCACGCATTATGGAAGCAGAGATTATCCATATGATTGACAATCTGGAT 

GCAAGCATGATGATGATGTCAACAGCTCTTGCTTTGGTGGATAAAGGAGAGATGACCAATAAAATCTTCGCTATG 

GATAATCGTTCCTTCTATAAACCAGATTTAGATTAA 

4166.2 

ATGAGTGAAAAAGCTAAAAAAGGGTTTAAGATGCCTTCATC^ 

CAGTGCTAACrTGGTTTATCCCTGCGGGGGCCTTTATAGAAGGTATTTACGAGACTCAGCCTCAAAATCCACAAGG 

GATTTGGGATGTCCTCATGGCACCGATTCGGGCTATGCTAGGTACTCATCCAGAGGAAGGTTCGCTCATTAAAGAA 

ACGAGCGCAGCG ATTG ATGTAGCC I ' TC IH'CATCCTTATGGTTGGTGGTTTCCTTGGCATTGTCAACAAAACTGGTG 

CTCTTGACGTAGGGATTGCCTCTATCGTGAAGAAGTATAAGGGCCGCGAAAAAATGTTAATTTTGGTACTGATGCC 

TTTGTTTGCCCTCGGTGGTACAACHTATGGTATGGGTGAAGAAACAATGG 

ATGATGGCCGTTGGTTTTGATAGCCTGACTGGTGTTGCAATTA 

TACTCTGAATCCATTTGCGACAGGTATTGCTTCAGCGACTGCGGGAGTTGGTACAGGGGACGGTATCGTACTTCGT 
CTGATCTTCTGGGTTACCITGACTGCTCTTAGTACTTGGTTTGTTTACCGTTATGCGGATAA 

GACTAAGTCACTGGTTTATAGTACTCGCAAAGAAGATTTGAAACACTTTAACGTAGAAGAATCTTCATCTGTAGAA 
TCTACACTTAGCAGCAAACAAAAATCAGTTCTCITCTTATTTGTGTTGACATTCATCTTGA 
TCCATGGACAGACCTTGGCGTTACCATTTTrGATGACTTTAATACTTGGTTGACTGGTCTTCCAGTTATTGOT 
TTGTCGGTTCATCTACTTCTGCACTAGGTACT^^ 

CTGATTGGTGTTATTTATGGTCTTAAAGAAGATAAGATTATCTCTTCCrrTCATGAATGGTGCTGCT 

TGTTGCCTTGATCGTAGCGATTGCTCGTGGTATTCAAGTTATCATGAACGACGGTATGATTACCGATACAATCCTC 

AACTGGGGTAAAGAAGGCTTGAGCGGTCTATCTTCACAAGTCTTTATCGTTGTA 

GTCATTCTTGATCCCATCTTCATCTGGTCTTGCCAGCGCAACTATGGGTATCATGGC 

TCCACTTGGAGAATTTGTAAATGTCCGTCCTAGCTTGATTATCACTGCTTACCAATCTGCTTCAGGTG 

TGATTGCACCAACATCTGGTATTGTGATGGGAGCTCTTGCACTTGGACGTATCAACATTGGTACTTGGTGGAAATT 

CATGGGCAAACTCGTAGTCGCTATTATTGTAGTGACCATCGCCCTTCTTCTCCTTGGAACCTTCCTTCCATT 

AA 

4166.3 

ATGAAAATAGATATAACAAATCAAGTTAAAGATGAATTTCHTATATCATT 

TACTCAATGAAGGAGAAAATGGAACACCTTTTGGACAAGCAATCCAAGATGTCCTAGAAAAAACTTTAGAG 
GTCGAGACATAGGTTTCACTACCTATCTTGACCCTAAAGGTTATTACGGATATGCAGAAATCGGTCAGGGAGCAG 
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AGCTTCTGGCCATTCTCTGTCATTTGGATGTTGTTCCATCAGGTGATGAj\GCAGATTGGCAGACACCGCCATTTGA 
AGCAACTATCAAAGACGGCTGGGTATTCGGACGTGGTGTCCAAGATGATAAAGGCCCTTCGCTCGCAGCTCTCTA 
TGCAGTAAAAAGCTTGCTGGACCAAGGTATTCAGTTCAAAAAGCGCGTACGCTTTAT CTTTG GTACCGATGAGGA 
AACCCTCTGGCGCTGCATGGCACGCTACAATACCATCGAAGAACAGGCCAGTATGGGCTTTGCACCTGACTCATC 
5 TTTTCCTCTGACCTATGCTGAAAAAGGGCTTCTACAGGTCAAA 

GAAGTAGGAGGCGCCTTTAACGTTGTACCAGACAAGGCCAACTACCAAGGTCTCCTCTATGAACAGGTTTGTAAC 
GGTCTCAAAGAAGCTGGTTATGATTACCAAACCACTGAACAAACCGTAACGGTTCTCGGAGTGCCAAAGCATGCT . 
AAGGATGCTAGTCAAGGTATCAATGCTGTCATCCGACTAGCTACCATTCTTGCrCCTCTCCAAGAACACCCTG 
TCAG 1T1 lCri GCAACACAAGCAGGTCAAGACGGCACAGGAAGACAAATCnTTGGTGATATAGCAGATGAACCTT 

10 CTGGTCACCTATCCTTTAATGTCGCAGGTCTCATGATCAATCATGAACGTTCTGAAATCCGTATTGACATTCGGAC 
TCCTGTCTTAGCTGACAAGGAAGAACTAGTAGAGTTGCTTACAAGATGTGCACAAAACTACCAACTCCGCTACGA 
AGAGTTTGACTATCTAGCGCCTCTATACGTCGCAGAAGACAGTAAACTCGTTAGCACACTGATGCAAATCTACCA 
AGAAAAGACTGGCGATAACAGTCCTGCTATTTCATCCGGTGGTGCCACTTTTGCTCGCACCATGCCAAATTGTGTA 
GCCTTCGGCGCCTTATTCCCAGGAGCGAAGCAGACAGAACATCAGGCAAATGAATGTGCCGTTCTAGAAGATTTG 

1 5 TACCGTGCTATGG ATATTTATGCCG AAGCCGTCTATCGACTTGCAACTTAA 

4169.1 

ATGTCTAATTCATTTGTCAAGTTGTTAGTCTCTCAATTAT^ 

CATTGCTAACATATACATTATTTCAAAATCAGTAATTGCCACATCACTAGTTCC TATC TT AATA GGAATATCCTC^ 
20 TTGTTGCGAGTCTTTTAGTTCCGTTGGTTACTAAAA 

AAGACTATATTATTGGCGATACTGGTAGGAATGTTTACCGTAATGCAATCCGTAGCGCCTTTGGTGACCTATCTAT 

TTGTTGTTGCAATTTCCATACTAGATGGTTTTGCAGCACC 

rTGGGTAAGGCTAATTCAGCCTTATCAATGACTGGTGAAGCTGTTCAATTGATAGGTTGGGGA TTAG GTGGACTCT 
TGTTTGCAACAATTGGTCTGTTACCTACCACGTGTATCAATTTAGTCTTGTA TATCA TTTCT 
25 14 ' l ' L ' 14 CCTAACGCTGAAGTGGAGGTGTTAGAGTCAGAAACTAATCTrGA AATTTTG CTCAj\A 
' TTGCTAGAAATCCTAGATTAAGACTTTTTGTATCAGCAAATTTATTGGAAATTT^ 
TCCATTATACTTGTTTTTGTAACGGAGTTATTAAATAAAACGGAAAGTTACTGGGGATATTCTAATA 
CTATTGGTATTATAATTAGTGGCTTAATTGCTTTTAGGCTATCTGAAAAGTTCCTTG 

ATTATTCACCCCAAATCTAj\AAACCATCCAGAATCCTTGCCTTAGCTTAGATCCTGGATGGl MCI 11 1 1 1CACCCA 
30 ATGGGTGTTTTTTACTAGACAAAAAAGAGTITCCCCTTTATGG 

AAACTCACATGAACAGTTTACCAAATCATCACTTCCAAAACAAGTCTTTTTACCAACTAT 

TTTAACCCAGTATGGTGGTCTTATC V 1"! ' I'l 1CAGGAACTIT1T1 CCCAGTTGAAACTAAAAGAGCGG ATTT CTAAGT 
ATTTAGTAACGAATGACCAACGCCGCTACTGTCGTTATTCGGATTCAGATATCCTTGTCCAGTTCCTCTTTC 
TTAACAGGTTATGGAACGGACTATGCTTGTAAAGAATTGTCAGCTGATGCCTACTT 
35 TCCAAAATTGTTGGAAGGAGGGCAGCTTGCTTCACAGCCAACCTTATCCCG TTTI CI T TCCAG AACTGACGAGGAA 

ACAGTCCATAGTTTGCGATGCCTCAACCTTGAATTGGTCGAATTC^ 

TTGTAGATATCGATTCTACCCATTTCACAACTTATGGCAAGCAAGAAGGTGTTGCTTATAACGCCCACTATCGTGC 
TCATGGCTATCATCCTCTTTATGCTTTCGAGGGGAAGACAGGTTATTGTTTCAATGCCCAGCTT CGTC CTGGTAATC 
GTTATTGTTCTGAAGAGGCAGACAGCTTTATCACACCTGTTTTAGAACGGTTTAATCAACTT 

40 TAGTGGCTirGCGACCCCAAAATTATACGATTTAATTGAAAAAACAGGGCAATACTACCTCATAAAACT 

AAATACTGTTCTGAGCCGTCTTGGAGACCTTTCCCTCCCTTGCCCACAGGATGAGGACnTAACCATCTTGCCCCAC 
TCCGCCTACTCAGAAACTCTCTATCAAGCAGGATCTTGGTCGCACAAGCGTCGTGTCTGCCAGTTCTCTGAACGAA 
AAGAAGGAAACTTGTTCTACGATGTTATTTCTCTCGTTACAAATATGACGAGTGG AACAAGC CAAGACCAGTTTCA 
GCTTTATCGTGGACGTGGTCAAGCCGAGAATTTCATCAAGGAGATGAAGGAGGGATTTTTTGG CGATAA AACGGA 

45 TAGTTCAACCTTAATCAAAAACGAAGTTCGTATGATGATGAGCTGTATCGCCTACAATCTCTATCTTTTTCTCAAA 
CATCTAGCTGGAGGTGACTTCCAAACTTTAACAATCAAACGCTTCCGCCATC1 TTI 111 1 CA CGTGG TGGG AAAA T 
GTGTTCGAACAGGACGCAAGCAGCTCCTCAAATTGTCTAGTCTCTATGCCTATTCCGAATTGTTTTCAGC^ 
TTCTAGGATTAGAAAAGTCAACCTGAATCTTCCTGTTCCTTATGAACCACCTAGAAGAAAAG 
CATTAA 

50 

4169.3 

ATGATGGAGTTTTTTCAACAGCTTCCTCATTTAGAGCCATATC 

AACCTTGCCC ATCTTT AT AGGTCTC ITTT 1'C AAG AAACGCTTTGCCTGGT ATG AAGTGTTGGT AAGTCTCI' 1 C 1" l'l A 
TTGTCACCATGTTGGTGGGTGGAAAGACCAATCAACTAGCTGCCTTGGGTATTTACCTTTGCTGGGAAA 
55 CCTGCmTCTACAAGCATTATCGAAAAAGCAAGGATGGCAAGTGGGTCITCTAC^ 
CTTCCGATTATCTTTGTCAAGGTGCAACCAGCTATCAATGGAACGCAG 

CCTGACCTTTCGTTCGGTTGGAATTGTCATCGAGCTGAGAGATGGAGTGATTAAGGATTTTACCCTCTGGGAAT^ 
CTCCGTTTCCTTCTCTTCATGCCAACTTTCnX^GAGTGGTCCAA 

GGCTATTCCTGAGCGAGATGAGTTGATGGATATGCTGGATGAATCTGTCCGCTATATCATGTGGGGCTTTTTGTAT 
60 AAGTTTATCCTAGCTCATGTTTTAGGAGAGACCTTACTACCTCCTCTGAAGAATTTAGCCTTG 
TCTTTAATCTCTATGCCTTGGCAGTTATGTATACll'riGGTCTGGAACTCri'ClU'l 
TTGCTTTGGCCATCTCAAACTrGATGGGAATCCGTAGCCCTATCAACTTTAACAAGCCCTTTTr 
AAAGGAGTTITGGAATCGCTGGCATATGAGTCTGTCCTTCTGGTT^^ 

GTGTTAACCAGAAAGAAAGTCTTTAAAAATCGTAATGTAACCTCAAGCATGGCCTACATTGTAAATA 
65 TGGGATTTTGGCATGGTGTGACCTGGTACTATATCGCCTATGGACTCTTTCATGGACTAGGCTTGGTCATC 
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TGCCTGGGTTCGCAAGAAAAAAACGCTCAATAAGGAACGGAAAAAAGCAGGGAAGGCTGCCCTACCTGAGAATC 

GCTGGATTCAGTTGCTTGGCATGGTTGTCACTTTCCATGTTGTCATGTTGTC 

ATAATCTATGGTTTAAAAAATAA 

5 4169.4 

ATGCTTAAACGCrrTATGGATGATCTTCGGACCGGTCTTGATCGCTGGTTTGTTGGTTTTTCT 

CCTACTGAGATGCATCATAATCTAGGAGCTGAAAAGCGTTCAGCAGTGGCTACTACTATCGATAGTTTTAAGGAGC 
O^TCAAAAAGTCAGAGCACTATCTGATCCAAATGTGCGTTTTGTTCCCTTCT^ 
1A TTTTGACGGTGCTCATCCTGCGGTATTAGCTGAGAAATACAATCGTTCCTACCGTCCTTATCT^ 
1U GGAGCTGCATCGCTTAACCA^TATTTTGGAATGCAACAGATGTTACCACAGCTGGAGAATAAACAAGTTGTGTAT 
GTTATCT CACC TCAGTGGTTCAGTAAAAATGGCTATGATCCAGCAGCCTTCCAGCAGTATTTTAATGGAGACCAGT 
TGACTAGTTTTCTGAAACATCAATCTGGGGATCAGGCTAGTCAATATGCAGCGACTCGCTTACTGCAACAGTTCCC 
AAACGTAGCTATGA AGGAC CTGGTTCAGAAGTTGGCAAGTAAAGAAGAATTGTCGACAGCAGACAATGAAATGAT 
■ TGAATTATTGGCTCGTTTTAArGAACGCCAAGCTTC 
ID ATAAGCATGTAGCTAAGTATTrAAAAATCTTGCCAGACCACriTTTCITATCAGGC^ 

AGATGCTGAAAAAAATACTTCCAATAATGAGATGGGAATGGAAAATTATTTCTATAATGAGCAGATCAAGAAGG^ 
TTTGAAGAA ATTA AAGGATTCTCAGAAAAGCTTTACCTATCTCAAGTCGCCAGAGTATAATGACTT 
TTAACACAGTTTTCTAAATCTAAGGTAAACCCGATTTTTATC 

CTGGTCTACG AGAGG ATATGTACCAACAAACGGTGCAGAAGATTCGCTACCAGTTAGAAAGTCAAGGTTTTACCA 
ATATA GCAGA run CTAAGGACGGCGGGG AGCC IT 1 C IT 1ATG AAGG ACACCATTCACCTTGGTTGGTTGGGTTG 
. GTT^CTTTTGACAAGGCAGTTGATCCITTCCTATCCAATCCCACACCAGCTC 
TTTTTCAGCAAAGATTGGGCGACTTATGATGGAGATGTCAAAGAATTTCAATAG 



20 



^ 41696 

ID ATGG AG AAAAACCTCAAGGCTTTGAAAC AAA CAACAGACCAAG AAGG CCC AG CAATTGAACCTGAAAAGGCAG A 

GGATACCAAGACAGTCCAA AATGG TTACTTCGAGGATGCAGCTGTCAAGGACCGCACCTTGAGTGACTATGCAGG 
TAACTGGCAATCAGTTTATCCHTTCCTrGAAGACGGCACGTTTGACCAAGTCTTTGACT 

GGTAAGATGACCCAGGCTGAGTACAAGGCTTACTATACAAAAGGCTATCATACAGATGTGACTAAGATTAACATT 
ACTGATAATACTATGGAATTTGTTCAAGGTGGACAAAGCAAGAAATACACTTACAAGTATGTCGGTAAGAAAATT 
TTGACTTACAAGAAAGGCAATCGTGGCGTGCGTTTCCTCTTTGAAGCCACAGATGCTGACGCTGGACAATTCAAGT 
ATGTTCAGT TTAG TGACCACAjVTGTrGCCCCAGTTAAGGCAGAACATTTCCATATCTTCTTTGGAGGCACAAGCCA 
AGAAGCCCTCTTTGAAGAAATGGACAACTGGCCAACCTACTACCCAGATAACCTATCTGGCCAAGAAATCGCCCA 
AGAAATGTTGGCGCATTGA 

35 4170.3 

ATGAAAGATGGTCAJTTGCTAGCCCATCATATTCGTT^ 

ATCCTGAGGCTCTTTATAGGGGTGAACAGGGCAAGATTTTAGCGGTTTTATGGAATAGTGAAACTGGCTGCGCAA 
CTGCGACAGATATCGCGCTTGCGACTGGACTTGCGAATAATACGCTGACGACTATGATAAAAAAGCTAGAGGAAC 
AAAAGCTTGTAATTGTTAGTCCGTGTGGAAAAGACAAGCGTAAGAAGTATTTAGTTTTAACGGAGTTAGGCAAGT 
W CCCAGAAAGAAGTGGGGCATCGTGTCAGTCAGAAATTGGATACTATCnTTTACAAAGGA 
TrCACCAATTTGAAGGTTTTCAAGAAAGAATTTTGGCGAATCTGAAAGAGAAGGG 

4170.4 

A< A TGACT AATTTAATTGCAAet riT^GATCGTrTTAGTCATTGCTTGACAGCTCTATCT^ 

*D Gci I L rGACCTTGTTACTAGCTATTTTGCTTGCGATTCCCTTGGCTG I 1 1'l'I cn'CGCTATCATGAGAAGCTGGCCG 

ACTGGGTCTTGCAGATTGC AGGT ATTTTCCAGACCATCCCGTCTCTGGCCITGTTGGGGCrCTTTATCCCTTTGA 
GGAATTGCK3ACCrTGCCGGCTTTGACAGCTCTAGTGATTTATGCGATTTTCCCT 

GGCTGAAGGGAATTGATCCGAACCTGCAAGAGGCTGGGATTGCCITTGGGATGACCAGATGGGAACGTCTCAAGA 
AATTTGAAATTCCACrCGCCATGCCTGTTATCATGTCTGGGATTCGGACGGCAGCTGTTTTGATTATCGGTACGGC 
AACCTTGGCGGCCTTGATTCCTGCAGGGGGACrAGGTTCCm 

TTGATTTTGATTGGG GCACT TTCIT CTGCA GTGCTAGCCATTGCCTTTAACTTCCTACT 
CAAAATTACGGACGATTTTCTCAGGTTTTGCCTTGGTGGCTTTATTACTGGGTCTGTCTTATAGTCCAGCT 
GTTCAAAAAGAGAAGGAAAACTTGGTTATTGCTGGGAAAATAGGTCCAGAACCAGAAATTTTGGCCAATATGTAT 
AAGTTGCTGATTGAAGAAAATACCAGCATGACTGCGACTGT^ 

^GCrCTGAAAAAAGGCGATATrGACATCTATCCTGAATTTACrGGTACGGTGACrGAAAGTTTGCTTCAAC 
ACCCAAGGTGAGTCATGAACCAGAACAGGTTTATCAGGTGGCGCGTGATGGCATTGCTAAGCAGGATCATCTAGC 
CT ATCTCAAACCCATGTCTTATCAAAACACCTATGCrGTAGCTGTTCCGAAAAAGATTGCTCAAGAATATGGCTTG 
AAGACCATTTCAGACrTGAAAAAJ^GTGGAAGGGCAGTTGAAGGCAGGTTTTACACTCGAGTTTAACGACCGTGAA 
GATGGAAATAAGGGCTTGCAATCAATGTATGGTCTCAATCTCAATGTAGCGACCATTGAGCCAGCCCTTCGCTATC 
OU AGGCTATTCAGTCAGGGGATATTCAAATCACGGATGCCTATTCGACTGATGCGGAATTGGAGCGTTATGATTTACA 
GGTCTTGGAAGATGACAAGCAACTCTTCCCACCTTATCAAGGGGCTCCACTCATGAAAGAAGCTCTTCTCAAGAA 
ACACCCAGAGTTGGAAAGAGTTCTTAATACATTGGCTGGTAAGATTACAGAAAGCCAGATGAGCCAGCTCAACTA 
CCAAGTCGGTGTTGAAGGCAAGTCAGCAAAGCAAGTAGCCAAGGAGTTTCTCCAAGAACAAGGTTTGTTGAAGAA 
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4170.5 

ATGATGCATACTTATTTGCAAAAGAAAATTGAAAA^ 

CGTATGGTTGCGGCTATGGCTGATTTAGGATTTTCAGGAACr^^ 

GTAGTTTTGCCCAGTGGATTTATTTGCTGGTTTTAGGAAGTTTTCCTCTCTGG 

ATTGTTGACTGGATTGGGATGATTTGTAGCTTG AC AG GG ATT ATCTGTGTAATCTTTGTATCGG 

GTAATTATCTTTrTGGCTTGATTAACTCTGTTATTTACCTTATTTTGGCCCT 

CTGACGACACTTTACTTCACAGTCATGCAGCCAATTGGACTTCTAGTT^ 

AAAAGCAGGAGTTTGTCGCGCGTAAACTGGACGGCAAGGGCTGGACAAAGTATCTTTCCATTAGTGTGCTTTGGT 

GGTTGGCCTTTGGCTTCATTTATCAGTCTATTGGTGCCAATCGTCCCTATCGTGATTCAATCACAGATGC AACCA A 

TGGGGTAGGGCAAATCCTCATGACAGCTGTTTACCGTGAACAGTGGATATTCTGGGCGGCTACCAATGTCTTTTCA 

ATCTATCTCTGGTGGGGAGAAAGCCTGCAAATTCAAGGGAAATATCTAATTTATCTCATTAACAGTCTAGTTGGTT 

GGTATCAATGGAGCAAGGCAGCTAAGCAGAATACTGATTTACTTAACTAG 

4170.6 

ATGAGAAATATGAAGGCAAAATATGCTGTTTGGGTGGC1 11" riU'Cl'l AAATrTGACTTATGCCATTGTTGAGTTTAT 

TGCAGGTGGAGTATTTGGTTCTAGCGCTGTTCTTGCTGACTCTGTGCATGACTTGGGAGATGCGATTGCAATTGGA 

ATATCAGCTITTCTAGAAACAATCTCCAATCGTGAAGAAGACAATCAGTACAC CrrTGG GCTATAAGCGGTTTAGCC 

TGCTAGGAGCCITGGTAACAGCTGTGATTCTCGTAACGGGCTCTGTTCTAGTCATTTTGGAAAATGTCACGAAGAT 

TTTGCATCCGCAACCAGTCAATGATGAGGGGATTCTCTGGTTAGGAATTATTGCGATTACTATCAATCTGTTAGCG 

AGTCTGGTGGTTGGTAAGGGAAAGACAAAGAATGAGTCTATTCTGAGTCTGCATTTTCTGGA AGATA CGCTAGGG 

TGGGTAGCTGTTATCCrGATGGCGATTGTTCTTCGATTTACGGACTGGTAT 

C A rTTCTTTCTT 1 ' ATTCTTTC AAA AGCCCTTCC ACG [TTn GGTCT AC ACTC AAG A 1 ' 1 1 111 1 GGATGCTGTGCCAG 

AAGGTCTTGATATCAAGCAAGTAAAGAGTGGCCTGGAGCGATTGGACAATGTGGCCAGCCTTAATCAGCTTAATC 

TCrGGACTATGGATGCrTTGGAAAAAAATGCCATTGTCCATGTTTGTCrAAAAGAAATGGAACATAT 

TAAAGAGTCTATTCGAATTTTCCTAAAAGATTGTGGTTTTCAAAATATTACCATTGAA^ 

ACTCACCAAACCCATAAGCGAAAGGTGTGTGACTTGGAACGGAGTTATGAGCATCAACATTAG 

4170.8 

ATGATTGAATACAAAAATGTAGCACTGCGCTACACAGAAAAGGATGTCTTGAGAGATGTCAACTTACAGAT TGAG 

GATGGGGAATTTATGGTTTTAGTAGGGCCTTCTGGGTCAGGTAAGACGACCATGCTCAAGATGATTAACCGTCT 

TGGAACCAACTGATGGAAATATTTATATGGATGGGAAGCGCATCAAAGACTATGATGAGCGTGAACTTCGTCTTT 

CTACTGGTTATGTTTTACAGGCTATTGCT C r i l l I CCAAATCT AACAGTTGCGG AAAAT ATTGCTCTCATTCCTG AA 

ATGAAGGGGTGGAGCAAGGAAGAAATTACGAAGAAAACAGAAGAGCnTTTGGCTAAGGTTGGTTTACCAGTAGCC 

GAGTATGGGCATCGCTTACCTAGTGAATTATCTGGTGGAGAACAGCAACGGGTCGGTATTGTCCGAGCTATGATTG 

GTCAGCCCAAGATTTTCCTCATGGATGAACCCITITCGGCCTrGGATG 

GACAAAAGAATTGCATAAAGAGTTTGGGATGACAACGATTTTTGTAACCCATGATACGGATG AAGC GTTGAAGTT 
GGCGGACCGTATTGCTGTCrTGCAGGATGGAGAAATTCGCCAGGTAGCGAATCCCGAGACAATTTTAAAAGCGCC 
TGCAACAGACTTTGTAGCAGACTTGTTTGGAGGTAGTGTTCATGACTAA 

4171.1 

ATGTCAGCAGTrGCrATTTCAGCTATGACCAAGGTTATGCAAGAAACCCACGGAAATCCrrCTAGTATTCAT 

ATGGTCGTCAAGCTGGCAAACTCTTGCGAGAAGCCCGTCAGGAACTAGCCCAGTTACTAAGGACAAAACCTCAAC 

ATAT C r iTn CACrrCTGGTGGGACTGAAGGCAATAATACTACCATC 

AGGAAAACATATCATCACAACTGCCATCGAGCACCATGCTGTCCrrGAAACAATTGATTACTTGG TTCA A 

GGGTTTGAAGCAACCATTATCCAGCCAGAAAATCAAGAAATCACAGCCCAGCAAATTCAAAAGGCnTTACGTGAC 

GATACGATTTTGGTTTCTACCATGTTTGTCAATAATGAGACAGGAAACCT^ 

TACrCAAGCAACACCCTGCTGCCTATCATGTTGATGCAGTTCAGGCTATTGGTAAAATCCCAATTCATTCAGAAGA 

ATTGGGCATTGATTTTCTCACTGCTTCTGCCCACAAATTCCATGGTCCTAAGGGAATCGGTTTTCT 

GCATGGACTTTGATTCCTATCTACATGGCGGAGACCAGGAACAGAAAAAACGTGCAGGAACTGAAAATCTGCCTG 

CCATTGTAGGCATGGTTGCAGCCCTAAAAGAAGACCTAGAAAAACAAGAAGAACATTTTCAACATGTACAAAATC 

TAGAAACTGCCTTTCTGGCAGAGCTGGAGGGCATTCAGTATTACCTGAATAGAGGAAAACATCATCTC 

TCTCAATATTGGATTTCCTGGTCAGAAAAATGACCTCTT'ACTCCTTCGGCTAGATTT 

CTGGCTCAGCCTGTACTGCAGGCGTTGTCCAATCCAGCCATGTTCITGAAGCCATGTATGGCGCAAATTCAGAACG 

CTTGAAGGAATCCCTTCGCATCAGTTTGTCGCCACAAAATACCGT^^ 

AAAGAAATTATCGGAGGTTAG 



4172.1 

ATGTTATTCAAATTATCTAAGGAAAAAATAGAGCTAGGCn*ATCTCGTTTATCGCCAGCCCGTCGTA 1 1 I 1 1 1 1GA 
GTTTTGCCTTGGTCATTTTACTAGGCTCTCT 1 C I 114GAGOTGCCCTTTGTCCAAGTTGAAAGCTCACGAGCGACT 
TATTTTGATCATCrTTTCACrGCTGTCrcrGCAGTCTGTGTGACGGGTCTCTCAACCCrTCCA 
TAATATCTGGGGTCAAATAATCTGTTTGCTCTTC 

TCTATATCCAGAGCAAGCAAAAGCTTAGTCITCGTAGCCGTGCAACTATTCAGGATAGTTTTAGTTATGGAGAAAC 
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TCG^CTTTGAGAAAGTTTGTCTATT^ 
GTTTTCGC CTTAT TCCTCAACrTGGCTGGGGACGT 

AATGCCGGTTTTG ATAATTTAGGG AGCACCAGTTT ATTTG C 1 TI 1 1 CAG ACCG ATTTACTGGTCAATCTGGTG ATTCC 
AGGCITGATTATTACAGGCGGCCTTGGTTTTATGGTCTGGTTTGATTTGGCTGGTCAT^ 
GGAC GTCTGCACTTTCATACGAAGCTrGTACTATTATTGACTATAGGTTTGTTGTTATTTGGAACA 
CTTTCTTGAGTGGAACAATGCTGGAACGATTGGCAATCTCCCTGTTC 

ACAGTGACGATGCGAACAGCTGGCTTTTCTACGATAGATTATACTCAGGCTCATCCTGTGACTCriUUGATTTATA 
TCT TACA GATGTTTCTAGGTGGGGCACCTGGAGGAACAGCTGGGGGACTCAAGATTACGACArrrrTTGTCCTCTT 
GGTCTTTG CACG AAGTGAGCTTCTAGGCTTGCCTCATGCCAATGTTGCGAGACGAACGATCGCGCCGCGAACGGTT 
CAAAAATCCTTT AGTG TCnTTATTATCTTTTTGA 

AGGCAATCCTCCCTTTATCCACCTCGTATTTGAAACCATTTCAGCTCTTAGTACAGTTGGTGTAACGGCAAATCTG 
ACTCCTGACCTTGGGAAATTGGCTCTCAGTGTTATCATGCCACTTATGTTTATGGG 

ACGAmATTGGTCCCTTGACCTTGTTTGTTAGCTTGGCAGATTACCATCCAGAAAAGAAAGATATGATTCACTATATG 
AAAGCAGATATTAGTATTGGTTAA 

4172.2 

ATGTCAGATCGTACGATTGGA^TTTTGGGCrTGGGAAl-ri'riGGGAGCAGTGTCCTAGCTGCCCTAGCCAAGCAGG 

ATATGAATATTATCGCTATTGATGACCACGCAGAGCGCATCAATCAGTTTGAGCCAGTTTTGGCGCGTGGAGTGAT 

TGGTGACATCACAGATGAAGAATTATTGAGATCAGCAGGGATTGATACCTGCGATACCGTTGTAGTCGCGACAGG 

TGAAAATCTGGAGTCGAGTGTGCTTGCGGTTATGCACTGTAAGAGTTTGGGGGTACCGACTGTTATTGCTAAGGTC 

AAAAGTCAGACCGCTAAGAAAGTGCTAGAAAAGATTGGAGCTGACTGGGTTATCTCGCCAGAGTATGAAATGGGG 

CAGTCTCTAGCACAGACCATTCTTTTCCATAATAGTGTTGATGTCTTTCAGTTGGATAAAAATGTGTCTATCGTG 

AGAT GAAAATTCCTCAGTCTTGGGCAGGTCAAAGTCTGAGTAAATTAGACCTCCGTGGCAAATACAATCTGAATA 

TTTTG GGTT TCCGAGAGCAGGAAAATTCCCCATTGGATGTTGAATTTGGACCAGATGACCTCTTGAAAGCAGATAC 

CTATATTTTGGCAGTCATCAACAACCAGTATTTGGATACCCTAGTAGCATTGAATTCGTAA 

4172.3 

ATGAAGTTATTGTCTATCGCA ATTTC TAGCTATAATGCAGCAGCCTATCTTCATTACTGTGTGGAGTCGCTAGTGA 
TTGGTGGTGAGCAAGTTGGGATTTTGATTATCAATGACGGGTCTCAGGATCAGACTCAGGAAj\TCGCTGAGTGTTT 
AGCTAGCAAGTATCCTAATATCGTTAGAGCCATCTATCAGGAAAATAAATGCCATGGCGGTGCGGTCAATCGTGG 
CTTGGTAGAGGCTTCTGGGCGCTATTTTAAAGTAGTTGACAGTGATGACT 

ATTCTTGAAACCTTGCAGGAACTTGAGAGCAAAGGTCAAGAGGTGGATGTCTTTGTGACCAATTTTGTCT 
AGGA^GGGCAGTCTCGTAAGAJVGAGTATGAGTTACGATTCAGTCnTGCCTGTTCGGCAGATTTTTGGCTGGGACCA 
GGTCGGAAATTTCTCCAAAGGCCAGTATACCATGATGCACTCGCTGATTTATCGGACAGATTTGTTGCGTGCTAGC 
CAGTTCTAA 

4172.4 

ATGAAATTCAATC CAAA TCAAAGATATACTCGTTGGTCTATTCGCCGTCTCAGTGTCGGTGTTGCCTCAGTTGTTG 

TGGCTAGTGGCTTCTTTGTCCTAGTTGGTCAGCCAAGTTCTGTACGTGCCGATGGGCTCAATCCAACCCCAGGTCA 

AGTCTTACCTGAAGAGACATCGGGAACGAAAGAGGGTGACTTATCAGAAAAACCAGGAGACACCGTTCTCACTCA 

AGCGAAACCTGAGGGCGTTACTGGAAATACGAATTCACTTCCGACACCTACAGAAAGAACTGAAGTGAGCGAGGA 

AACAAGCCCTTCTAGTCTGGATACACTTTTTGAAAAAGATGAAGAAGCTCAAAAAAATCCAGAGCT 

CTTAAAAGAAACTGTAGATACAGCTGATGTGGATGGGACACAAGCAAGTCCAGCAGAAACTACTCCTGAACAAGT 

AAAAG GTGG AGTGAAAGAAAATACAAAAGACAGCATCGATGTTCCTGCTGCTTATCTTGAAAAAGCTGAAGGGAA 

AGGTCCTTTCACTGCCGGTGTAj\ACCAAGTAATTCCHTATGAACT^ 

TTACTAAAAGCTTCGGATAATGCTCCTTGGTCTGACAATGGTACTGCTAAAAATCCTGCTTTACCTCCT 

GATTAACAAAAGGGAAATACITCTATGAAGTAGACTTAAATGG 

TTGATCAACTTCGCGCTAATGGTACTCAAACITATAAAGCT 

TGACTTGACTAATCrAGTTGCTACTAAAAATGTAGACATCAACATCAATGGATTAGTTGCTAAAGAAA 

AAAGCCGTTGCAGACAACGTTAAAGACAGTATCGATGTTCCAGCAGCCTACCTAGAAAAAGCCAAGGGTGAAGGT 

CCATTCACAGCAGGTGTCAACCATGTGATTCCATACGAACTCTTCGCAGGTGATGGCATGTTGACTCGTCTCTTGC 

TCAAGGCATCTGACAAGGCACCATGGTCAGATAACGGCGACGCTAAAAACCCAGCCCTATCTCCACTAGGCGAAA 

ACGTGAAGACCAAAGGTCAATACTTCTATCAAGTAGCCTTGGACGGAAATGTAGCTGGCAAAGAAAAACAAGCGC 

TCATTGACCAGTTCCGAGCAAATGGTACTCAAACTTACAGCGCTACAGTCAATGTCTATGGTAACAAAGACGGTA 

AACCAGACTTGGACAACATCGTAGCAACrAAAAAAGTCACTATTAACATAAACGGTTTAATTrCTA 

TTCAAAAAGCCGTrGCAGACAACGTTAAAGACAGTATCGATGTTCCAGCAGCCTACCTAGAAAAAGCCAAGGGTG 

AAGGTCCATTCACAGCAGGTGTCAACCATGTGATTCCATACGAACTCTTCGCAGGTGATGGTATGTTGACTCGTCT 

CTTGCTCAAGGCATCTGACAAGGCACCATGGTCAGATAACGGTGACGCTAAAAACCCAGCCCTATCTCCACTAGG 

TGAAAACGTGAAGACCAAAGGTCAATACTTCTATCAATTAGCCTTGGACGGAAATGTAGCTGGCAAAGAAAAACA 

AGCGCTCATTGACCAGTTCCGAGCAAACGGTACTCAAACTTACAGCGCTACAGTCAATGTCTATGGTAACAAAGA 

CGGTAAACCAGACTTGGACAACATCGTAGCAACTAAAAAAGTCACTATTAACATAAACGGT^ 

AACAGTTCAAAAAGCCGTTGCAGACAACGTTAAGACAGTATCGATGTTCCAGCAGCCTACCTAG 

4172.5 
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ATGAAACTAAAAAGTTATATTTTGGTTGGATATATTATTTC^ 
TCAAAAAATGCTGATTGCGAAAGGCGAGATTTACTTTTTGCTTGGGATGACCA 

GGGATTAGTCTCTTTCTCCTATTGCCAGTCTTTACGTCGTTGGGCAAACTCAAGGAGCATGCCAAGCGGGTAGCGG 

CCAAGGATTTrCCTTCAAATTTGGAGGTrCAAGGTCCTGTAGAATTTCAGCAATTAGGGCA^ 

GTCCCATGATTTGCAGGTAAGCTTTGATTCCTrGGAAGAAAGCGAACGAGAAAAGGGCTTGATGATTGCCCAGTT 

GTCGCATGATATTAAGACTCCTATCACTTCGATCCAAGCGACGGTAGAAGGGATTTTGGATGGGATTATCAAGGA 

GTCGGAGCAAGCTCATTATCTAGCAACCATTGGACGCCAGACGGAGAGGCTCAATAAACTGGTTGAGGAGTTGAA 

TTTTTTGACCCTAAACACAGCTAGAAATCAGGTGGAAACTACCAGTAAAGACAGTATTTTTCT 

ATTGAGTGCATGAGTGAATTTCAGTTTTTGATTGAGCAGGAGAGAAGAGATGTCCACTTGCAGGTAATCCCAGAGT 

CTGCCCGGATTGAGGGAGATTATGCrA^GCrTTCTCGTATCTTGGTGAATCTGGTCGATAACGC ITITAAATATTC 

TGCTCCAGGAACCAAGCTGGAAGTGGTGGCTAAGCTGGAGAAGGACCAGCTTTCAATCAGTGTGACCGATGAAGG 

GCAGGGTATTGCCCCAGAGGATTTGGAjWlTATTTTCAAACGCCITTATCGTGTCGAAACTTCGCG 

acaggtggtcatggattaggacttgcgattgcgcgtgaattggcccatcaattgggtggggaaatcacagtcagc 
agccagtacggtctaggaagtacctttaccctcgttctcaacctctctggtagtgaaaataaagcctaa 

4172.6 

atgtttggtcaaacggctcaacatggtcttacgaatagcctgaaagacttctgga rrm ctgctgaatataggtc 

cacaattggcgtttttttgccagatgctccgctgttccagatcggttgagcagggtactggaaatcaccgtcgtga 

gttcaatatgattcagcagatattctcgcattttgggatgacrcacitgggacaaatcaagttggtcratcaagag 

tcgattgaccttgagttgctggtcaatgcacttaatcatcacttgctcattgacagactggtcctcacgcccaatc 

ajvataacgatagaaatcgacaggcagatagtacatggtcttgacctgctgaaggggcgtaaagacaaagagatta 

tcgacataaaaagtatgttcaggcagttagaactggctagcacgcaacaaatctgtccgataaatcagcgagtgc 

atcatggtatactggcctttggagaaatttccgacctggtcccagccaaaaatctgccgaacaggcaagactga 

4174.1 

atggaacatttagcaactta 1 i i i lcaacctatggaggagcrncl l'cgctgcattgggaattgtattggcggttg 
gattaagcggtatggggtctgcttatggagttggtaaggctgggcaatctgccgcagctttactgaaagaacagc 
ctgaaaagtttgcctcagctttgatattgca^ttattgccc 

tttaatttggtrgcaattaactccagaacttcctttagaaaaaggggttgctta i'l l c i ttgtagctcttccaattg 
ctattgtaggatacrri^cagcraagcatcaj\ggaaatgtagcagtagcgggaatgcaaa tcttg gctaaaagac 
caaaagaattcatgaagggagcaattttagctgccatggtagaaacctatgcaattcitgcrr^ 
cattttgacccttcgtgtataa 

4175.2 

ATGTTAAAATCAGAAAAACAATCACGTTATCAAATGTTAAATGAAGAATTGTCCTTCCTATTGGAAGGCGAAACC 

AATGTTTTGGCTAATCnTrCCAACGCCAGTGCrCTCATAAAATCACGTTTTCCTAATACCGTA 

TTTGTTCGATGGAAAGGAATTGGTTTTAGGCCCCTTCCAAGGAGGTGTTTCCTGCATCCGTATTGCACTAGG 

GGTGTTTGTGGTGAGGCAGCTCACTTTCAGGAAACTGTTATTGTTGGAGATGTGACGACCTATCTCAACTATA 

CTTGTGATAGTCTAGCTAAAAGTGAAATTGTGGTGCCGATGATGAAGAATGGTCAGTTACTTGGAGTTCTGGATCT 

GGATTCTTCAGAGATTGAGGATTACGATGCTATGGATCGAGATTATTTGGAACAAT^ 

AAGACAGCATGGGACTTTACGATGTTTGAGGAAAAATCTTAA 

4175.3 

ATGTCAGTATTAGAGATCAAAGATCTTCACGTTGAGATTGAAGGAAAAGAAATTTTA AAAG GGGTTAACCTGACC 
CrGAAAACAGGAGAAATTGCCGCTATCATGGGACCAAATGGTACAGGTAAATCGACrcrTTCTGCCGCTATCATG 
GGAAATCCAAACTATGAAGTAACTAAAGGTGAAGTTTTGTITGATGCCGT 

GAGCGTGCGCGTATGGGACTTTTCCTTGCTATGCAATACCCATCAGAAATCCCTGGAATTACCAATGCTGAGTTTC 

TTCGTGCCGCTATGAATGCGGGTA^VAGAAGATGATGAGAAGATTTCAGTTCGTGAGTTTATTACTAAGCTAGATGA 

AAAAATGGAATTGCTCAACATGAAAGAmAGAAATGGCAGAGCGTTACCTCAACGAAGGCTTCTCTGGTGGTGAGAA 

AAAACGCAATGAAATTCTTCAAC1 11 1 GATGTTGGAGCCAACATTTGCTCir ITGG A CG AG ATTGACTCAGGTCTT 

GATATTGACGCTCTTAAAGTTGTGTCTAAAGGTGTCAATGCCATGCGTGGTGAAGGTTTTGGTGCTATGATCATCA 

CTCACTACCAACGTCTTTTGAACTATATCACACCrGATGTGGTACACGTGATGATGGAAGGTCGTGTTGTCCnTC 

TGGTGGTCCAGAATTGGCTGCGCGTTTGGAACGTGAAGGATACGCAAAATTAGCTGAAGAACTTGGCTACGACTA 

CAAGGAAGAATTGTAA 

4174.4 

ATGCCCTACAAAAGACAAAGGAGTTTTTCAATGGCACTTTCTAAACTAGATAGCCrrTATATGGCAGTGGTAGCAG 
ACCATTCGAAAAATCCACATCACCAAGGGAAGTTAGAAGATGCTGAGCAAATCAGTCTCAACAATCCGACTTGTG 
GGGATGTCATCAACCTCrCTGTCAAGTTTGATGCAGAGGACCGTTTGGAAGATATTGCriUU'CTAAATTCAGGATG 
C ACG ATTTC AACTG CTTCTGCT AGTATG ATG AC AG ATGCCGTTTT AGG AAAAACC AAAC AAG AAATTTTAG AACTG 
GCGACTA n ri l'ICTGAAATGGTTCAAGGGCAAAAAGATGAGCGTCAAGACCAACTTGGAGACGCGGCATTCTTG 
TCAGGTGTTGCCAAATTCCCTCAAAGAATCAAGTGTGCAACCCTAGCrrGGAATGCCCITAAGAAAACAATTGAA 
AATC AAG AA A AAC AGTAA 

4175.5 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



57 



PCT/GB99/02451 



ATGAAAATTCAAGACCTATTGAGAAAAGATGTCATGTTGCTAGATTTGCAGGCAACTGAAAAAACAGCT 
GACGAGA TGAT TAAAAATTTGACAGACCACGGTTATGTAACAGATTTTGAAACATTTAAAGAAGG 
CGTGAAGCTTTGACTTCTACTGGTTTGGGTGATGGAATCGCAATGCCTCACAGCAAAAACGCTGCT 
GCGACAGTTCH\\TTTGCTAAGTCAAATAAGGGTGTTGACT^ 
5 TCATGA TTGC AGCTCCAGAAGGTGCCAATGATACTCACTTGGCAGCCnTGGCAGAATTGTCTCAATACTTGATGAA 
AGACGGTTTTGCAGACAAACTTCGTCAAGCAACATCTGCAGACCAAGTTATCGAACH^ 
AAAACTGAGGAACTTGTTCAAGCACCTGCTAATGACTCTGGTGACTTTATCGTAGCTGTTACAGC^ 
GTATTGCCCACACTTACATGGCCCAAGAAGCCCTTCAAAAAGTAGCTGCTGAAATGGGGGTTGGTATCAAGGTCG 
AAACCAACGGTGCTAGCGGTGTTGGAAATCAACTAACTGCAGAAGATATCCGTAAGGCTAAAGCTATTATCATTG 
10 CAGCAGACAAGGCCGTTGAj\ATGGATCGATTTGATGGAAAACCATTGATCAATCGTCCAGTTGCTGACGGTATCC 
GTAAGACAGAAGAGCTAATTAACTTGGCTCTTTCAGGAGATACTGAAGTCT 
CTGCAACAGCCTCTAACGAAAAACAAAGCCTTGGTGGTGCCITGTACAA^ 

TGTTACCATTCGTTATCGGTGGTGGTATCATGATTGCCCTTGCCTTCTTGATTGACGGTGCTTTGGGTGTTCCAA 
GAAAACCTTGG CAATC TTGGTTCrrTACCATGAGTTAGCTTCTATGTTCATGAAAATTGGTGGAGCT 
15 GATGCTTCCAGTCTTTGCGGGTTATGTTGCCTACTCTATTGCTC 
GGTGCXATTGCCAAAGAAGGTTTTGCCrTTGGTAAAATTCCTT^ 

CAGGTGTCTCATCTGGTrrCCTAGGTGCCCTTGTTGGTGGATTTATCGCAGGTGCCTTGGTTCTTGCCATCAAGA^ 

TACGTTAAAGTTCCTCGTTCACTCGAAGGTGCTAj^TCAATCCTTCTATTGCCACTTCTTGGAACAA 

GATTTGTTATGCTAGCTGTGAATATCCCAATGGCTGCAATCAACACTGCTATGAATGACTTCCTAGGCGGTCTTGG 

ZU AGGAGGTTCA GCTGT CCTTCTTGGTATCGTCCTTGGTGGAATGATGGCTGTTGACATGGGTGGACCAGTTAATAAA 
GCAGCTTATGTCITTGGTACAGGTACGCTTGCAGCAACTGTTTCTTCAGGTGGTTCTGTAGCCATGGCAGCAGTTA 
TGGCTGGAGGAATGCrrGCCACCACTTGCAATCTTTGTCGCAACTCTTCTTTTCAAAGATAAATT^ 
ACGTAACTCTGGTTTGACAAACATCATCATGGGCTTGTCATTTATCACTGAGGGAGCGATTCCATTTGGTGCCGCT 
GACCCAGCTCGTGCGATTCCAAGCTTCATCCTTGGTTCAGCAGTAGCAGGTGGACTCGTTGGTCTTACTGGTATCA 

ZD AACTCATGGCGCCACACGGAGGAATCrTCGTTATCGCCCTTACTTCAAATGCTCTCCTTTACCTCGTT^ 
GTAGGAGCAATCGTAAGTGGTGTGGTTTATGGTTACCTACGCAAACCACAAGCATAA 

4175.6 

ATGGCAAACAAGAATACAAGTACAACAAGACGGAGACCGTCTAAAGCAGAACTGGAAAGAAAAGAAGCGATTCA 
^^5^^^^"^^^^"^^^^*^^^^^ ^^^^^ ATTT^ A'TT'G ATXTT^CG C AG COTTTC A>\>VTT AOGGG OTG C AGGX AT AACC 
CTTTATAATTTAATTCGCTTGCTAGTGGGTAGCCTAGCTTATC 

TTTCAAGTGGATACGAAAACAGGAAGGACTCTTATCTGGCTTTTTCACCATATTTGCT 

AGGCCT ACTT GGTTTGGAAAT ATGG TTTGGACAAGTCCGTTCTAAAAGGGACCATGGCTCAGGTTGTGACAGATCT 
GACTGGTTTrCGAACGACTAGCTTTGCTGGAGGGGGCTTGATCGGGGTCGCT 
3D TTTTCAAATATCGGA ACTTA CTTTATTGGTTCTATCTTGATTTTAGTGGGTTCTCT 

TACGATATTGCTGAATTTTTCAGTAGAGGCTTTGCCAAATGGTGGGAAGGGCACGAGCGTCGAAAAGAGGAACG^ 

TTTGTCAAACAAGAAGAAAAAGCTCGCCAAAAGGCTGAGAAAGAGGCTAGATTAGAACAAGAAGAGACT 

AGCCTTACTCGATTTGCCTCCTGTTGATATGGAAACGGGTGAAATTCTGACAG 

CCTATTCCAGAAGAAAAGTGGGTGGAACCAGAAATCATCCTGCCTCAAGCTGAACTTAAATTCCCTGAACAGG^ 
4U GATGACTCAGATGACGAAGATGTTCAGGTCGATTTTTCAGCCAAAGAAGCCCTTGA^ 

CAACTCTT TGCAC CAGA TAAA CCAAAAGATCAGTCTAAAGAGAAGAAAATTGTCAGAGAAAATATCAAAATCT^ 
GAAGCAACCTTTGCTAGCTTTGGTATTAAGGTAACAGTTGAACGGGCCGAAATTGGGCCATCAGTGACCAAGTAT 
GAAGTCAAGCCGGCTGTTGGTGTAAGGGTCAACCGCATTTCCAATCTATCAGATGACCTCGCTCTAGCCTTGGCTG 
CCAAAGAT GTCCG GATTGAAGCACCAATCCCTGGGAAATCCCTAATCGGAATTGAAGTGCCCAACTCCGATATTG 
4D CCACTGTATCTTTCCGAGAACTATGGGAACAATCGCAAACGAAAGCAGAAAATTTCTTGGAAATTCCTTT 
AGGCTGTTAATGGAACCGCAAGAGCTTTTGACCTTTCTAAAAT^ 

AGGGAAGTCAGTAGCAGTTA^CGGCATTATTGCTAGCATTCTCATGAAGGCGAGACCAGATCAAGTTAAATTTAT 
GATGGTCGATCCCAAGATGGTTGAGTTATCTGTTTACAATGATATTCCCCACCTCTTGATTCCAGTCGTGACCAAT 
CCACGCAAAGCCAGCAAGGCTCTGCAAAAGGTTGTGGATGAAATGGAAAACCGTTATGAACrCTTTGCCAAG 

0U GGAGTTCGGAj\TATTGCAGGTTTTAATGCCAj\GGTAGAAGAGTTCAATTCCCAGTCTGAGTACAAGCAAATTCCG 
CTACCATTCATTGTCGTGATTGTGGATGAGTTGGCTGACCTCATGATGGTGGCCAGCAAGGAAGTGGAAGATGCTA 
TCATCCGTCTTGGGCAGAAGGCGCGTGCTGCAGGTATCCACATGATTCTTGCAACTCAGCGTCCATCTGTTGATGT 
C ATCT CTGGTTTGATTAAGGCCAATGTTCCATCTCGTGTAGCATTTGCGGTTTCATCAGGAACAGACT^ 
ATTTTGGATGAAAATGGAG CAGAA AAACTTCTTGGTCGAGGAGACATGCTCTrTAAACCGATTGATGA 

33 CCAGTTCGTCTCCAAGGCT CCTTT ATCTCGGATGACGATGTTGAGCGCATTGTGAACTTCATCAAGACTCAGGCAG 
ATGCAGACTAC GATGAG AGTTTTGATCCAGGTGAGGTTTCTGAAAATGAAGGAGAATTTTCGGATGGAGATGCTG 
GTGGTGATCCGC1 i n lGAAGAAGCTAAGTCTTTGGTTATCGAAACACAGAAAGCCAGTGCGTCTATGATTCAGCG 
TCGTTTATCAGTTGGATTTAACCGTGCGACCCGTCTCATGGAAGAACTGGAGATAGCAGGTGTCATCGGTCCAGCT 
GAAGGTACCAAACCTCGAAAAGTGTTACAACAATAA 

4176.1 

CTG 1 11 1 ICTAGTT CTCT TGATTTTTGGCTTTTTTGGCTGG 

GCCTGAGGGAGAGTTTTGATGAGAGTGTrCATTTTGGGACTTCGCGTATTCTAGGAAATAGTATCGGTGGACTCTA 
CO TGCCTTGGTc i i ci TCI rATTAAATACCTTTITCCACGAAGCCITTTGGGTGACCrTGGTAGTTGTTCCAATCTGCA 
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CCATGTTAACCATTATGACAAATGTAGCCATGAATAACAAAGCAGGGGTTATTGGTGGTGTAGCAGCTATGTTAAT 

CATTACCCTATCAATTCCAAGTGGTGAGACAATTTTGTACGTGTTTGTGCGTGTATTAGAAACGTTTATGGGAGTT 

TTTGTCGCAATTATCGTAAATTACGATATTGATCGTATTCGTCTCTTTTTAGAGAAAAAAG 

4178.2 

ATGAATAAATCAGAACACCGCCACCAACTTATACGCGCTCTTATCACAAAAAACAAGATTCATACACAGGCTGAG 

TTGCAAGCCCTTCTTGCTGAGAACGACATTCAAGTAACCCAGGCAACCCTCTCACGCGACATCAAAAATATGAAC 

CTATCAAAAGTCCGCGAAGAAGATAGCGCTTATTATGTTCTTAACAATGGTTCCATCTCAAAATGGGAAAAACGTC 

TCGAACTCTACATGGAAGACGCCCTTGTCTGGATGCGCCCAGTTCAACACCAAGTCCTACTAAAAACCCTTCCTGG 

ACTGGCTCAmATCCTTTGGTTCTATCATTGATACTTTGAGCTTCCCTGACGCTATCGCTACCC^ 

TCTGTCTTATCATCTGTGAAGATGCAGATACTGCTCAAAAGTGCTTTGAj\GAACTGAAAAAATTCG 

TTTCTTTGAAGAATAA 

4179.1 

atgaaaagtataaaattaaatgctctatcttacatgggaattcgtgtcttgaatatta nrn cccatcctaactgg 

aacctatgtcgcgcgtgtcttggaccgaactgactatggtracttcaactcagtcgacactattttgtcal'lu'l 

tgccctttgcaacttatggtgtctataactacggtttaag 

taacagaaccttttctagtcttttttatttgtgcatcgcttgtacgattttgacc^ 

atcctctcttctttactgataatccaatcgtcaaaaaggtctaccttgttatggggattcaactca 

ttttcaatcgaatgggtcaatgaagctctggaaaattacagttttctctttracaaaact^ 

gatgctggtctctall 1 icl 1 atttgttaaaaatg aac a cg at attgttgtct at ac a cttgtgatgagtttatcg a 

cgctgattaactacctgatragttatttttggattaaaagagacatcaaacttgtta^ 

taaaccactctttctcccrctgacagccatgttagtctttgccaatgccaatatgctctrcacrr 

tcttcctcgttaaaacagggattgatgtcaacgttagrractataccatagcrcagcgaattgtgaccgttatagc 

tggggttgtaacaggtgcaarrggagtgagtgtgcctcgtctcagttactatctggggaaaggagacaaagaagc 

ctatgtttcrctggttaatagaggtagtcgaatctttaac 11 cl^'l'atcattccactgagttttggactcatggttt 

taggaccaaatgccatcctacrtttacggtagtgaaaaatatatcggaggc^ 

tcgtacgattatc ctggc cn'agataccattcttggttcccaaattctctttaccaatg 

acagtctatacagtctttgctgggctactcaatttgggcrtgaatagtct 

tgaatactacttacrgacaact atgc tatcagagacttctctacttgttttctatatc^ 

aactc atcc acttggg acatatcttt agct atactgttcg atactctct c ri 11 cactttcctttgtagcaatttatt 

TCCTGATTAATTTCGTGTATCCTGTAGATATGGTCATTAATTTGCCA IT1 Tl GATTA 

ATACTGGTTTGATTGTCTTGCTATCAGCTATCTCTTATATTAGTCTA 

GAATTTTTAAACCATGTCCTAGCCTTAAAAAATAAATTTAAAAAATCATAG 

4179.2 

ATGAAACAACTAACCGTTGAAGATGCCAAACAAATTGAATTAGAAATTTTGGATTATATTGATACT 

AGCACAATATCAACTATATTATTAACTACGGTACTCTGArrGGGGCGGTTCGACATGAGGGCTTTATCCCTTGG^ 

CGACGATATTGATCTGTCCATGCCTAGAGAAGACrACCAACGATTTATTAACATTTTrCAAAAGGAAAA^ 

GTATAAGCTCCTATCCTTAGAAACTGATAAGAACTACTTTAACAACTTTATCAAGATAACCGA 

ATTATTGATACTCGAAATACAAAAACCTATGAGTCTGGTATCTrr^^ 

TCCTAAGGTCATTGATACTTGTTATAAACTGGAAAGCTTCAAACT^ 

TATAAGGATAGCCTTTTAAAAGATTGGATACGAACAGCCTTCTGGTTACTCCTTCGACCGGTITCTC 

TGCAAATAAAATCGAGAAAGAAATTCAAAAATATAGTCGTGAAAATGGGCAATATATGGCTTTTATCCC^ 

ATTTAAGGAAAAGGAAGTCTTCCCAAGTCGTACCTTTGATA 

CCrGCACCTGAAAAATTTGATACTATTrrGACACAATTTTATGGAGATTATATGACCCTACCACCAG^ 
GCTTCTACAGTCATGAATTTCACGCTTATAAATTGGAGGATTAG 

4179.3 

ATGATAAAAATGAATCATCTAACCATCACACAAAACAAAGATTTACGAGATCTTGTATCTGACCTAACCATGACC 

ATCCAAGACOTGGAAAAGGTTGCTATTATTGGTGAAGAAGGAAATGGCAAATCAACCrrACTTAAAA 

GGGGAAGCTTTGTCTGATTTCACTATCAAGGGAAACATCCAATCTGACTATCAGTCACTGGCCTACATTCCTCAAA 

AAGTCCCTGAGGACCTAAAAAAGAAA ACTTT ACACGACTAC1 TCn'rn'AGATTCTATTGATTTAGACTACAGTAT 

CCTCTATCGT TTGGC GGAGGAATTGCATTTTGATAGCAATCGTTTCGCAAGTGACCAAGAGATTGGCAATCTATC^ 

GGGGGCGAAGCTTTGAAAATTCAGCTTATCCATGAGTTAGCCAA^CCC^^ 

CAAATGACCTAGACC TTGAGACAG TTGATTGGCTAAAAGGCCAGATTCAmAAAGACCAGGCAAACCGTTATTTTCA 

TTTCCCATG ATG AAG AC rTTCTTI CTG AAACGGCAG AC ACTATTGTTC ACTTGCG ACTGGTC AAAC ACCGT AAAG A 

AGCGGAAACGCTAGTAGAGCATITAGACTATGATAGCrATAGTGAGCAGAGAAAGGCrAATTTTGCCAAACAAAG 

TCAGCAAGCTGCTAACAACCAAAGAGCCTACGATAAAACCATGGAAAAACATCGGAGAGTTAAGCAAAATGTAG 

AAACrGCGCTTCGAGCTACCAAAGATAGTACTGCCGGTCGCCTATTGGCTAAAAAGATGAAAACTGTCCTCTCAC 

AAGAAAAACGCTACGAAAAGGCAGCTCAGTCCATGACTCAAAAGCCACTTGAAGAGGAACAAATCCAACTTTTCr 

TT TCAG ACATCCAACCATTACCAGCrrCTAAAGTCTTAGTCCAACTGGAAAAAGAAAATTTGTCC 

AGTTTTGGTTCAAAAACTACAACTAACTGTCCGTGGCCAAGAAAAAATCGGTATTATCGG 

GCCAAATGGTGTTGGGAAATCAACrcrGTrAGCCAAGTTACAGAGACTTCrGAATGATAAAAGAGAGATTTCACT 
TGGTTTTATGCCACAAGATTACCACAAAAAACTGCAATTGGATTTATCCCCAATAGCCTATCTCAGTAAAACTGGG 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



PCT/GB99/02451 



59 



GAAAAAGAGGAACTACAGAAAATCCAATCTCACCTAGCTAGTCTCAATTTCAGTTATCCAGAAATGCAGCATCAA 

ATTCGCrCCTTATCTGGCGGACAACAGGGAAAACTCCTGCTTTTGGATTTAGTCCT 

TGCTGGATGAACCCACACGAAACTTTTCTCCCACTTCTCA^ 

CGGTCTCATCACTGTrTCGCATGACCGTCGTTTCTTAAAAGAAGTCTGCTCGATCATCTATCGCATGACAGAACAC 
GGTTTGAAGCTAGTTAATTTAGAAGATTTATAA 

4179.4 

ATGAAAC CAAAA ACATTTTACAACTTGCTTGCCGAG 

GAACGTTATTT TGAGC TCTTGGTCGAGTGGAATGAGAAGATTAATTTGACGGCGATTACGGACAAGGAA 
TATCTCAAACATTTTTACGATTCGATTGCACCCATTCTTCAAGGTTTGATTCCCAATGAAA 

TATCGGGGCTGGGGCAGGATTTCCTAGTCTACCAATGAAAATTCTCTATCCGGAGTTAGATGTGACCATTATTGAT 

TCACTCAATAAGCGCATCAACTTCCTACAACTCTTGGCTCAAGAACTGGATTTGAACGGAGTTC^ 

GACGTGCCGAj\GATTTTGCCCAAGACAAGAACTTCCGTGCTCAATATGATTTTGTAACAGCTCGTGCGGTTGCCCG 

TATGCAGGTCCTATCTGAATTGACTATTCCCTACCTTAAGGTTGGTGGCAAACT^ 

GCGCCTGAGGAATTATTAGAAGCTAAGAATGCCCTCAATCTCCTTTTT^ 

CCCTACCGAATAGAGATCCGCGCTATATCACAGTGGTAGAAAAGAAAAAAGAAACACCAAATAAATATCCACGTA 
AGGCTGGTATGCCAAATAAACGCCCACTTTAA 

4179.6 

ATGAGTATTAAACTAATTGCCGTTGATATCGACGGAACCCTTGTCAACAGCCAAAAGGAAATCACTCCTGAAGT^ 

TTTCTGCCATCCAAGATGCCAAAGAAGCTGGTGTCAAAGTCGTGATTGCAACrGGCCGCCCTATCGCAGGCGTTGC 

CAAACITCTAGACGACTTGCAGTTGAGAGACGAGGGGGACTATGTGGTAACCTTCAACGGTGCCCTTGTCCAAGA 

AACTGCTACAGGACATGAGATTATCAGCGAATCCTTGACTTATGAGGATTATCTAGATATGGAATTCCTCAGTC 

AAGCTCGGTGTCCACATGCATGCCATTACCAAGGACGGTATCTATACTGCAAATCGCAATATCGGAAAATACACT 

GTACACGAATCAACCCTCGTCAGCATGCCrATCTTCTACCGTACCCCTGAAGAAATGGCTGGCAAAGAAATTGTTA 

AATGTATGTTTATCGATGAA CCAG AAATTCTCGATGCTGCGATTGAAAAAATTCCAGCAGAATTTTAC^ 

CTCCATCAACAAATCTGCTCCTITCTACCTCGAACTCCTTAAAAAGAATC 

TTGGCTGAAAAj\CTCGGATTGACCAAAGATGAAACCATGGCAj\TCGGTGATGAAGAAAATGACCGTGCCATGCTG 
GAAGTCGTTGGAAACCCCGTTGTCATGGAAAATGGAAATCCAGAAATCAAAAAAATCGCCAAATACATCACCAAA 
ACAAATGACGAATCCGGCGTTGCCCATGCCATCCGAACATGGGTACTGTAA 

4179.7 

ATGACTTGGATTATTCTrGGAGTTATCGCTCTTATTGTTATTTTTGTGATTGTTAGCT 
TCGTATGCAAACCAAGGAGGCTTGGAGTCAGATTGATGTTCAGTTGAAACGTCGC^ 

ATTGAGACTGTAAAAGGTTATGCCAAATATGAAGGTTCTACCCTTGAAAAGGTGGCAGAACTACGTAACCAAGTG 

GCGGCAGCGACTTCACCAGCAGAAGCTATGAAAGCCAGTGATGCCCTCACTCGTCAGGTTTCAGGTATTTTTGCAG 

TTGCAGAAAGCTATCCAGATTTGAAAGCTAGTGCTAACITTGTTAAATTGCAAGAG 

ATAAAATTTCTTACTCTCGTCAACTCTATAACAGTGTTGTCA 

CAATATTATCGCTGGAATGTTTGGATTTAAAGCGGCAGATTTCCTTCAAAC 

AAAGTTGATTTTAGCGGTTTAGGTGACTAA 

4179.8 

ATGTTGTTTGATCAAATTGCAAGCAATAAACGAAAAACCTGGATTTTGTTGCTGGTA riTT'I'^ 

TGTTGG TTAT GCGGTTG GTTATC TCTTTATAAGATCTGGACTTGGTGGTTTGGTTATTGCACTGATTATC^ 

TCTACGCTTTGTCTATGATTTTTCAATCGACAGAGATTGTCATGTCCATGAATGGAGCGCGTGAGGTGGATGAGCA 

AACGGCACCAGACCTCTACCATGTAGTGGAAGATATGGCTCTGGTCGCTCAGATTCCTATGCCCCGTGTTTTCATC 

ATTGATGATCCAGCCTTAAATGCCTTTGCGACAGGTTCTAATCCTCAAA 

TACTAGCTATCATGAATCGTGAAGAACTAGAAGCTGTTATGGGACATGAAGTCAGTCATATTCGTAATTATGATAT 
CCGTATTTCGACTATTGCAGTTGCCCTTGCTAGTGCTATCACCATGCTTTCTAGTATGGCAGGTCGTATGATGTGGT 
GGGGTGGAGCAGGTCGCAGACGAAGTGATGATGACCGAGATGGAAATGGTCITGAAATCATTATGCTAGTGGTTT 
CCCTACTAGCTATTGTACTGGCACCTCTCGCTGCAACCTTGGTTCAGCT 

GCAGATGCATCTAGTGTCGAGCTGACTCGCAATCCCCAGGGAATGATTAATGCCCTAGATAAGTTGGACAATAGC 
AAACCTAT GAGTC GCCACGTCGATGATGCTAGCAGTGCCCTTTATATCAATGATCCTAAGAAAGGTGGGGGGTTC 
CAAAAACTCT1TTATACCCACCCACCTATCTCAGAACGGATTGAACGTTTAAAACAGATGTAA 

4179.9 

ATGAAATTAAj\TATTCAAGAAATTCGTAAGCAGTCTGAAGGTTTGAACnTTGAACAAACGTTAGAT^ 

AeCTGCGTGCACGTAATCAAGAAATTTTAGATGTAAAAGATATCCTTGCAGTTGGGAAAGTACAATATGAAGACC 

GTATGTATTTCTTAGATTATCAACTATCTTATACCATTGTTCnTGCTTCGAGTCGCAGTATGGAGCCAGTTGAGTTA 

GTTGAATCITATCCAGTCACGGAAGTTTTCATGGAAGGCGCAACTAACCAGCTAGATCAAGAAGTTTTAGATGATG 

ACTTGGTCTTGCCCATCGAAAATGGGGAGCTTGACCTTGCTGAGAGTGTATCAGACAATATCCTGCTAAACATTCC 

TATCAAGGTCTTGACGGCTGAAGAAGAAGCTGGTCAAGGATTTATCTCAGGAAATGACTGGCAAATCATGACAGA 

GGAAGAATACCAAGCTCAAAAAGCAGTAAAGAAAGAAGAAAACAGTCCTTTTGCTGGCTTACAAGGACTATTTGA 

CGGAGATGAATAA 
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4179.12 

ATGGAGTTATTTATGAAAATCACAAACrATGAAATCTATAAGTTAAAAAAATCAGGTTTGACCAATCAACAGATT^ 
TGAAAGTGCTAGAJ^TACGGTGAAAATGTTGATCAGGAGCTTTTGTTGGGTGATATTGCAGATATCTCAGGTTGCCG 
TAATCCAGCCGTTTITATGGAACGTTATrrTCAGATAGACGATGCGCATTTGTCGAAAGAGT ^ 
5 TCTTTCTCTATTTTAGATGACTGTTATCCTTGGGATTTGAGTGAAATATATGATGCGCCT 

GGGAAATCTTGACCTCCTGAAJVTTCCCGAAGGTAGCGGTCGTGGGCAGTCGTGCTTGTAGCAuAACAGGGAGCTAA 
GTCAGTTGAAAAAGTCArrCAAGGCTTGGAAAATGAACTGGTTATTGTCAGTGGTCTGGCCAuAGGGCATT GACA C 
AGCAGCTCATATGGCAGCTCTTCAGAATGGCGGAAAAACCATTGCAGTGATTGGAACAGGACrGGATGTGTTTTA 
TCCTAAJVGCCAATAAACGCTTGCAAGACTACATCGGCAATGACCATCTGG*TTCTAAGTGAATATGGACCTGGTGA 
10 ACAACCTCTGAAATTTCATTTTCCTGCCCGTAATCGCATCATTGCTGGACTTTGTCGTGGTGT GATTG TAGCAGAG 
GCTAAGATGCGTTCAGGTAGTCTCATTACGTGTGAGCGAGCAATGGAAGAAGGACGCGATGTCTTTGCTATTCCTG 
GTAGCATTTTAGATGGACTATCAGACGGTTGCCATCATTTGATTCAAGAAGGAGCAAAATTGGTCACCAGTGGGC 
AAGATGTTCTTGCGGAATTTGAATTTTAA 

15 4181.1 

ATGAAACGTCAATTAGCCnTGGTCGTCTTTAGTGGTGGTCAAGATTCAACAACCTGCCTTTTCTGGGTCATG 
ACTATGAAACAGTCGAAGCTGTCACCTTTGCCTACGGCCAACGTCATCACCTCGAAATTCAAATTACTAGAG AAAT 
CGCTAAGGAACAGGGCATTCGTCACCATATCCTCGATATGTCTCTGCTGGGACAAATCACTGCTCAGCCAGACTTT 
GCGACGATTCATATTTCCTACATTCCTGACAAGCTCTGTGTCGAGTCAAAATCCCTCAAACTATATCTATTT 

20 ACCGAAACCACGGAGATTTCCACGAAAACTGTATCAACACCATCGGGAAAGACTTGGTCAACnTGCTAGACCCTC 
GCTATTTAGAAGTCTGGGGAAAATTCACTCCGCGCGGTGGCATTTCAATCGACCCCTACTACAACTACGGTAAGCA 
AGGAACTAAGTATGAGGGCTTGGCAGAACAACGCCTCTTCCAACACGACCTTTATCCAGAGAAAATTGACAACCG 
CTAA 

25 4181.2 

ATGACCGAAACGGTAGAAGATAAAGTAAGTCATTCAATTACTGGGCTTGATATCCTCAAGGGGATAGTTGCTGCG 
GGAGCTGTCATAAGTGGAACCGTTGCAACTCAAACGAAGGTATTTACAAATGAGTCAGCAGTACTTGAAAAAACT 
GTAGAGAAAACGGATGCTTTGGCAACAAATGATACAGTAGTTCTAGGTACGATATCTACAAGTAATTCAGCGAGT 
TCAACTAGTTTGTCAGCTTCAGAGTCGGCAAGTACATCTGCATCTGAGTCAGCCTCAACCAGCGCTTCGACCTCAG 
30 CAAGTACAAGTGCATCAGAATCAGCAAGTACATCGGCTTCGACAAGTATTTCTGCATCATCTACTGTGGTAGGTrC 
ACAAACAGCTGCCGCTACAGAAGCAACTGCTAAGAAGGTCGAAGAAGATCGTAAGAAACCAGCTAGTGATTATGT 
AGCATCAGTTACAAATGTCAATCTCCAATCTTATGCTAAGCGACGCAAGCGTTCAGTGGATrCCATCGAGCAATTG 
CTGGCrrCTATAAAAAATGCTGCTGTTTTTTCTGGCAATACGATTGTAAATGGCGCCC 

TAAACATTGCTAAAAGTGAGACAj^AAGTTTATACAGGTGAAGGTGTAGATTCGGTATATCGTGTTCCAATTTACTA 

35 TAAATTGAAAGTGACAAATGATGGTTCAAAATTGACCTTTACCTATACGGTTACGTATGTGAATCCTAAAACAAAT 
G ATCTTGGTAAT AT ATC AAGT ATGCGTCCTGG AT ATTCT ATCTATAATTCAGGT ACTTCAACACAAAC AATGTTAA 
CCCTTGGCAGTGATCTTGGTAAACCnTCAGGTGTAAAGAACTACATTACrGACAAAAATGGTAGACAGGTTC TATC 
CTATAATACATCTACAATG ACG ACGCAGGGTAGTGGGTATACTTGGGG A AATGGTGCCCAAATG AATGG I I iliu 
GCTAAGAAAGGATATGGATTAACATCATCTTGGACTGTACCAATTACTGGAACGGA 

40 TACATCCTTTACATTTACCCCTTACGCTGCTAGAACAGATAGAATTGGAATTAACTACTTCAATGGTGGAGGAAAG 
GTAGTTGAATCTAGCACGACCAGTCAGTCACTTTCACAGTCTAAGTCACTCTCAGTAAGTGCTAGTCAAAGCGCCT 
CAGCTTCAGCATCAACAAGTGCGTCGGCTTCAGCATCAACCAGTGCCTCGGCTTCAGCGTCAACCAGTGCGTCAG 
CTTCAGCAAGTACCAGTGCTTCAGTCTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACAAGTGCCTCGGCTTC 
AGCAAGCACATCAGCATCTGAATCAGCGTCAACCAGTGCTTCGGCTTCAGCAAGTACCAGTGCTTCAGCTTCAGC 

45 ATCAACCAGCGCCTCGGCCTCAGCAAGCAGCTCAGCTTCTGAATCGGCCTCAACCAGCGCCTCGGCCTCAGCAAG 
CACCTCAGCTTCTGAATCGGCCTCAACCAGCGCCTCAGCCTCAGCATCAACGAGTGCTTCGGCTTCAGCAAGCAC 
AAGCGCCTCGGGTTCAGCATCAACGAGTACGTCAGCTTCAGCGTCAACCAGTGCTTCAGCCTCAGCATCAACAAG 
TGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCTGAGTCAGCATCAACGAGTAC 
GTCAGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGTGCGTCAGCCTCAGCATCGACAAGCGCCTC 

50 AGCTTCAGCAAGTACCAGTGCTTCAGCCTCAGCGTCGACAAGTGCGTCGGCCTCAACCAGTGCATCTGAATCGGC 
ATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCATCGGCTTCAGCATCAACCAGTGCCTCGGCTTCAGCGTCA 
ACCAGTGCGTCAGCTTCAGCAAGTACCAGTGCTTCAGTCTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACA 
AGTGCCTCGGCTTCAGCAAGCACATCAGCATCTGAATCAGCGTCGACAAGCGCCTCAGCTTCAGCAAGTACCAGT 
GCGTCAGCTTCAGCATCAACCAGCGCCTCGGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGCGCC 

55 TCGGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGCGCGTCAGCCTCAGCATCAACGAGTGCTTCG 
GCTTCAGCAAGCACAAGCGCCTCGGGTTCAGCATCAACGAGTACGTCAGCTTCAGCGTCAACCAGTGCTTCAGCC 
TCAGCATCAACAAGTGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCTGAGTCA 
GCATCAACGAGTACGTCAGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGTGCGTCAGCCTCAGCA 
TCGACAAGCGCCTCAGCTTCAGCAAGTACCAGTGCTTCAGCCTCAGCTCGACAAGTGCGTCGGCCTCAACCAGTG 

60 CATCTGAATCGGCATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCATCAGCTTCAGCATCAACGAGTGCAT 
CGGCTTCAGCATCAACCAGTGCCTCGGCTTCAGCGTCAACCAGTGCGTCAGCTTCAGCAAGTACCAGTGCTTCAGT 
CTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACAAGTGCcTCGGCTTCAGCAAGCACATCAGCATCTGAATCA 
GCGTCGACAAGCGCcTCAGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCGTCGACAAGTGCGTCAGCCTCAGCA 
AGTACTAGTGCATCAGCTTCAGCATCAACGAGTGCATCGGCTTCGGCGTCAACCAGTGCATCAGAGTCAGCAAGT 

65 ACCAGTGCGTCAGCTTCCGCATCAACAAGTGCCTCGGCTTCAGCAAGCACCAGTGCGTCGGCTTCAGCAAGTACT 
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AGCGCCTCAGCCTCAGCCTCAACCAGTGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGT 

GCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCGTCAACAAGTGCATCGGCTTCAGCGTCAACGAGTGCG 

TCTGAATCGGCATCAACGAGTGCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCGTCAACAAGTGCATCG 

GCTTCAGCATCAACGAGTGCGTCCG CTTCAGCAjVGT ACT AGCG CCTC AGCCTCAGCGTCAACAAGTGCATCGGCTT 

CAGCGTCAACGAGTGCGTCTGAGTCAGCATCAACGAGTGCGTCAGCCTCAGCAAGCACATCAGCTTCTGAATCTG 

CATCAACCAGTGCGTCAGCCTCAGCATCGACAAGCGCCTCAGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCGT 

CGACAAGTGCGTCGGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCAAGTACCAGTGCGTCAGCCTCAGCGTCGA 

CAAGTGCGTCGGCCTCAACCAGTGCATCTGAATCGGCATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCAT 

CAGCTTCAGCATCAACGAGTGCATCGGCTTCAGCATCAACCAGTGCATCAGAGTCAGCAAGTACCAGTGCGTCAG 

TTCCGCATCAACAAGTGCCTCGGCTTCAGCAAGTACTAG 

4183.1 

ATGGGGGTCGAAACnTGGTTTTATTCTAGCATCTGCTGGCTGGCCATCGGGCT 
ACATGACrGCTGCTAATGGCGGTGGAGGCrrTTTACTAATCTTTCTCATTTCCACrA 
CTGCTGGCTGAGTTTGCCCTTGGCCGTAGTGCrGGCGTTTCCGCT^ 
ACAAGTACAACTTTATCGGTTGGATTGGCGCCTTTGCCCTCTTTATCCTCrr 

TGGATTCTAGTCTATCrAGGTATTGAGTrrGGGAAATTGTTCCAACrrGGTGGAACGGGTGATTATGCT 
TTACTTCAATCATTTCAAATCCAGCCATTGCCCTAGGAGCTCAAGCGGCCTTTATCCT 
TCACGTGGGGTTCAAAAAGGGATTGAAAGAGCrTCGAAAGTCATGATGCCCCTGCTCI^ 
TCATCGGTCGCrCTCrCAGTTTGCCAAATGCCATGGAAGGGGTTCTTTAOTCCrCAAACCAGACll 

gactagcactggtctcctctatgctctgggacaat c n rem i gccctctcactaggggttacagtcatgttgacct 

ATGCTTCnTACTTAGACAAGAAA-ACCAATCTAGTCCAGTCAGGAATCTCCATCGTAGCCATGAATATCTCGATATC 
CATCATGGCAGGTCrAGCCATTTTCCAAGCTCGATCCCCCrTCAATATCCAGTCTGAAGGGGGACCCAGCCTGCTC 
TTTATCGTCITGCCrCAACTCTTTGACAAGATGCClHU'l'GGAACCATTTTCTACGTCCT 

TTTGCGACAGTCACIUl'lTCTGTCGTGATGCTGGAAATCAATGfAGACAATATCACCAACCAGGATAACAGCAAAC 

GTGCCAAATGGAGTGTTATTTTAGGAATTTTGACCTTTGTCTTTGGCATTCCT^ 

GCGGATGTTCACATTTTTGGTAAGACCITCTTTGACGCTATGGACrTC^ 

AGCTCTCTACCnTCACn TI 1 ACAGGCTATATCTTTAAAAAGGCrCTTGCAATGGAGGAACTCCATCTCGATGAA 
AG AGCATGG AAAC AAGG ACTGTTCC AAGTCTGGCTCTTC C [TCT 1 'CG T rTCT 1'CGTTTCGTCATTCCAATC ATCATC 
ATTGTGGTCTTCATTGCCCAATTTATGTAATCAAAAAGGACTTGAGTAG 

4183.5 

ATGTTGAAAAAATGGCAGTTAAAAGATGTTATCTTGCrrGCTTTCTO 
TTCAGGATATGTGTATAATATTCTCAGTCrACTCTTAACACCrCTTGGTTTGCAGGC 

TCGGTCTCTGGTGTATGGCTGCGCCCATrGCrGCCATCTTTGTTCCGAGAGTCGGAAGTGCAACGATTGGAGAAGT 

GCTAGCrGCGCITGCrGAAGTGCTTTATGGTAGCCAATTTGGTCrAGGAGCTCTTTTG 

TGGGAAGTGAATTTGGTTTTATCGTAACTAAGAATCGCTATGAAAGTTGGCTCTCrCTAACTGCT 

GATTACGCTTGTTAGCTTTGTCTATGAATACATTAAGTTAGGTTACTACGCCTT^ 

GCrTGTGGTACGTTTTATTTCTGTTTATTTCTTCTGTACCATCTTGGTTCGTG 

TGCAACTGGAGGAAAAGCATAG 

4183.6 

ATGGTCAAAGTAGCAACCCAGACACCGATTATCAGTCTCITCrTGCTGATTTTATC 

TTCGATTGCTCTGACrCTTTCGGTAGTCGCATTTTGTATTCTCTTTATGCT 

CTTGGATGATCATACITGCCATTTTACCATCTITTGCC^ 

GGCAGTCATGCnTGGAACGAGGGCCTTTGTGACAGTTTGTATCGGCCTTGTCTITGTTTCCrCT 
AGCTTCTCTTGTACITGGCTCAAAAGGGGCTATCACGCrCTTGGTCCT 

CCTCT CATTCAGCAAGAAATCAAGTCCCTCAAAG A AG CTTGCCT ATT ACGTGGTCA AG AACTACATT^ 
CCTTGATTTACAGTAAGGTTCTGATGACAGTCTTTAGGTGGCGCCATCTTTACCrGAGAGCTCr 
ATATGACGAA CATGCACAGTTGAAGAATAGCTATCGGACiri'1'1 ATATTCCTAAAAAAACAAAATTAATCTACCTG 
CTTTTCrTTTTATTGCrrCAAACCAGTCTATTTTTATAA 

4183.7 

ATGA GAAA GCACCAATTACAAGTTCACAAATTAACCATTTTATCTATGATGATTGCCCTTGATGTAGTCCTTACAC 
CTATCTTTCGAATTGAGGGAATGGCACCGATGTCCAGTGTAGTCAATATTCTAGCAGGAATCATGATGGGACCTGT 
TTATGCCrTGGCTATGGCrACAGTCACAGCCTTTATCCGTATGACGACT 
GGAGCGACTITTGGAGCCCrrCTAGCAGGTCTCriTTAT 

AGATTTTGGGAACAGGTATTATTGGTTCCATTGTTrCCTATCCTGTTATGGTACTCT 
CTTAGCTGGTTTATCTACACGCCTr 

ATTTTTAATCAAGCAGGAATTCnTTAAAAAAGTGCAGGGATATTTCTITAGTGAAAGGATAGACrGA 
4183.8 

ATGCAGGAATTTACAAATCCCTTTCCTATAGGCTCTAGTTCCCTCATTCA 
GATGCTGGCAAATGGGATTTTGGCTCrGGGATGCAAACCTGTCATGGCA 

ACTAAGCAAAGTCAGGCTCTCTTCATCAATTTGGGGCATTTGTCAGCTGAGAAGGAAAAAGCAATCCGCATGGCA 
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GCTTCGTATGCAAACCAATCTTCTCTCCCGATGGTAGTAGATGCGGTTGGCGTAACGACTTCATCCATTCGTAAGA 
GCITAGTTAAAGACCrTTTAGACTATAGACCTACGGTCCTT^ 

ATTAAAGCACCACGGCGTTGGGGTCGATGCGAGTGCTAAAGATCAAGAAACGGAGGATTTGCTTCAAGTCTTGAA 
AGACTGGTGTCAG ACCTATCCTGGTATGT C 1 1 1 C 1 "1 AGTCACAGGTCCCAAGG ACCTCGTCGTTTCG A AAAATCAG 
GTCGCTGTACTGGGAAATGGCTGTACTGAATTAGACTGGATAACAGGGACAGGAGACTTGGTTGGAGCCTTAACA 
GCTGTTTTTCTCAGCCAAGGAAAGACTGGTTTTGAAGCTTCTTGCTTAGCAGT 

GAAAATAGTTGTTCAAGGAATGGGATTGGAAGAATTTCGTTACCAAGTACTCAATCAGCTTTCGCTCCT^ 
AGATGAAAATTGGCTAGATACCATCAAAGGAGAGGTTTATGAATAG 



4185.3 

ATGAACCATAAAATCGCAATTTTATCAGATGTTCATGGCAATGCGACGGCGCTAGAAGCAGTGATTGCAGATGCT 

AAAAATCA^GGGGCCAGTGA^TATTGGCTTCTGGGAGATA IUU ' IULUU 'CCTGGTCCAGGCGCAAATGACTTAGTCG 

CCCTGCTAAAGGACCTTCCTATCACAGCAAGTGTTCGAGGCAATTGGGATGATCGTGTCCTTGAGGCITTAGATGG 

GCAATATGGCTTAGAAGACCCACAGGAAGTTCAGCTCTTGCGTATGACACAGTATTTGATGGA GCGAA TGGATCC 

TGCAACGATTGTCTGGCTACGAAGCTTGCCTTTGCTGGAAAAGAAAGAAATTGACGGATTGCGCITTT 

CATAATTTACCTGACAAAAACTATGGTGGTGACTTGCTAGTTGAGAATGATACAGAGAAATTTGACCAACTGCTA 

GATGCGGAAACGGACGTGGCAGTTTATGGTCATGTTCACAAGCAGTTGCTTCGTTATGGAAGTCAAGGGCAACAA 

ATCATCAATCCAGGGTCGATTGGCATGCCCTATTTTAATTGGGAGGCGTTAAAAAATCACCGTTCCCAGTATGCCG 

TGATAGAAGTTGAAGATGGGGAATTACTCAATATCCAATTTCGTAAAGTTGCTTATGATTACGAAGCTGAGTTAGA 

ATTGGCCAAGTCCAAGGGGCTTCCCTTTATCGAAATGTATGAAGAACTGCGTCGTGACGATAACTATCAGGGGCA 

C AATCTG G A ATT ATTAG CC AG CTT AAT AG A A AAG C ATG GGT ATGTAG AGG ATGTG A AG AATTTTTTTG ATTTTTTG 

TAA 

4186.1 

ATGAATGTAAATCAGATTGTACGGATTATTCCrACTTTAuAAAGCTAATAATAGAAAATTAAA^^ 
TTGAAACCCTTGGAATGAAGGCCnTGTTAGAAGAATCGGCCTTTCTGTC^ 

GCTGGTTTTAGAAGAAGCTCCCAGTATGCGTACTCGTAAGGTAGAGGGAAGAAAAAAACTAGCTAGATTGATTGT 
CAj\GGTGGAAAATCCCTTAGAA^TTGAAGGAATCTTATCTAAAACAGATTCGATTCATCGATTATATAAAGGTC^ 
AAATGGCT ACGC IT H G AAATTTTCTCACCAG AAG ATG ATTTG ATTTTG ATTCATGCGG A AG ATG ACAT AGCAAGT 
CTAGTAGAAGTAGGAGAAAAGCCTGAATTTCAAACAGATTTGGCATCAArri'Cl'l I AAGTAAATTTGAGATTTCTA 
TGGAATTACATCTCCCAACTGATATCGAAAGTTTCTTGGAATCATCTGAA^^ 

AGCTCAGGGGCAGGATTTGACTGTGGACAATACGGTTACCTGGGACTTATCTATGCTCAAGTTCTTGGTCAATGAA 

TTAGACATAGCAAGTCTTCGCCAGAAGTrTGAGTCTACTGAATATTTTATTCCTAAGTCT 

GTAAAGATAGAAATAATGTTGAATTGTGGTTTGAAGAAGTATGA 

4186.2 

ATGAAGTGGACCAAGATTATTAAAAAAATAGAAGAACAAATCGAGGCAGGGATTTATCCCGGAGCCTCTTTTGCG 

TATTTrAAGGACAATCAATGGACAGAGTTCTATTTAGGCCAGAGTGACCCAGAGCATGGCTTGCAGACTGAGGCA 

GGACTAGTTTATGACCTAGCTAGTGTCAGCAAGGTTGTTGGGGTTGGCACAGTTTGTACCTTCTTGTGGGAAATAG 

GTCAATTAGATATTGATAGACTGGTAATAGATTTTTTACCTGAGAGTGATTATCCAGACATCACTATrCGCCAGCT 

CTTG ACTC ATG C AA C AG ACCTTG ATCC IT 1 ' 1 ATTCCT A ATCGTG AT C IT I 1 AACAGCCCCTGAATTAAAGGAAGCG 

ATGTTTCATCTCAACAGACGAAGTCAGCCAGCCTTTCTrrATTCGGATGTCCAlTTTTTGCTGTTGGGCT^ 

GGAAAGAATTTTTAATCAAGATTTGGATGTGATTITAAAGGATCAAGTCTG 

TAAGTTTGGGCCAGTTGAGCTTGCTGTTCCAACAGTTAGAGGTGTAGAGGCAGGCATAGTGCATGATCCCAAGGC 
TCGTCTCCTGGGTAGACATGCTGGGAGTGCTGGTTTATTTTCGACTATAAAGGATTTACAAATCri TITAGAACAC 
TATTTAGCAGATGATTTTGCAAGAGACTTAAATCAAAA rrrn'CTCCTTTGGATGACAAGGAACGTTCTTTAGCAT 
GGAATTTGGAAGGAGATTGGCTAGACCATACGGGCTATACAGGTACCTTTATCATGTGGAATCGTCAGAAGCAAG 
AAGCCACTATTTTCCTATCGAATCGTACCTATGAAAAGGACGAGAGAGCTCAATGGATATTAGACCGCAATCAAG 
TGATGAACTTGATTCGCAAAGAAGAGTAA 

4187.2 

ATGATGAAGAAGACTTATAATCATATTTTGGTCTGGGGAGTCATTTTCTATAGCATTTGCATTGTCTG n 1 1 1GCTT 
TACTCCTCAAGAACAATCTACCGTGGGAGTGGGAACTCCAGGTATTCAGCATCTTGGACGCCTGGri'l ri Cl'l'l'l G 
ACTCCTTTCAATTCTCTCTGGAAACTGGGCGAAGTGAGTGACATTGGACAATTATGTTGGA rTTTTTTACAAAATA 
TCCTCAATGTCTTCTTGTTTTTTCCTCTGATTTTC 

GTCCTTCTTTTT AG 1'IT 1'C 1' 1 GTG AGTCTTGG AATCG AGTGT ACGC A ATTAATCTTGG AC rmTCTTl G ATTTCAAT 

CGCGTCTTTGAGATTGATGATTTGTGGACCAACACTTTGGGTGGCTATCTGGCTTGGCT 

ATAAAAACAAGGTAAGGAATTAA 
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4188.1 

ATG A AG ATTCCTCTCTTA ACTTTTGC AAGG C AT A A ATTTGTTT ATGTCTTG CTT ACTTTG CTTTTTCTTGCTTTGGTT 
TATCGTGATGTTTTGATGACTTAl I 1 C 1 1 1 1 I I GATATTCATGCGCCCGATCTAGCTAAATTCGATGGACAAGCAAT 
T AA AAATG ACTT ATTAA AATCAGC ATT AG ATTTTCGTATTCTCCAGTTC AATCT AGG IT 1 ' 1 T ATC AATCATTT ATT A 
TTCCAATCATCATTG rrrrGCTAGGTTTTCAATATATTGAGCTGAAAAATAAAG nTl'ACGATTGAGTATTGGAAG 
AGAAGTGAGTTATCAAGGGTTAAAAAGAAAGTTGACTTTGCAAGTTGCAAGTATCCCTTGTTTGATAT 
ACTGTGCTGATAATTGCAATTATAACCTAl'l I Cr rTGGGACIl'llUCTCCTCTTGGATGGAATTCTCTATTTTCTGAT 
GGAAGTGGTTTACAAAGACTCCTAGATGGAGAGATAAAAAGCTATTTGTT(mTACTTGTGTCCT 
TTTTCATCAATGC AATCTA I ' I 1 l" I rACAAATAGTTGATTATGTGGGGAATGTGACTCGTTCGGCAATCACCTATTTG 
ATG n'TClTlGGCTTGGTTCTATGCTGCrTTATAGTC 
AGCTAGCTATGGGGATGTAAGTTTGATGAAACTCTTTACrCCX^ 
AAAAATATGAAGATAATGTTTAA 

4188.2 

ATGAAGATAATGTTTAAGAATTTTAACAATATTTrGCTAAATAGAAAGATTGTTTTACT 

GATGATTTTGATAAACCATCTATTGTCAACAGCGGTTCAAAAGCAGGATGCTGTTATCl ITTI CAAGAGAGAATTG 
ATrrCAATTrrTTCCTATAATGACTATTCTGAAGCGAATTTAGAAATCCCCAAACT 

CATGGTAGGATGGCTCTCTGTCATTTTACTTGAAAGTGATTTGGCAGACCATTACCATCACTTGATTCGCTATC 
TCAAGCTCCl'l'rr ICG ATTATAC AAGG AAACGATTGGTTGTCATTTCTAAA 1T1T1TACTCAAGATTTGTTTGTCTG 
GriUCll'GGTTTACTTCCTCTAGGAATTCATTTCAAAACAGTCGCACrL'l'l'C I'l'l'l'l' ACTTGCTCAGTTAATGATGTT 
GTACTTACTACTGTCTTATCTGATAGCACTGATTAGTGCGGGCGCTGG rrri ' l C Crrrri r CTCTATTTTTTAGCATT 
TGTGGGACAAGAATGGATGATGGATCATATTGTAACAGTGTATTTAGTACTCTTAAGTTTATTAGTTATGTTGATT 
GTTAGTCGCTTGGAAGAGAAATTTAAGAAAGGATAA 

4188.5 

ATGGGCAAAGGAGAGATGGGCAAAGGAGTTATTGGCTTGGAGTTCGACTCAGAAGTATTGGTCAACAAGGCTCCA 
ACCCTTCAATTGGCAAATGGTAAAACAGCGACTTTCCTAACCCAGTATC 

ATAAGGAAGATATCGGACAGGAAATTATTGGTATAGCTAAAGGAAGCATCGAAAGTATGCATAATCTTCCTGTAA 
ATCTAGCAGGTGCCAGAGTTCCTGGCGGAGTAAATGGTAGCAAAGCAGCGGTGCATGAAGTTCCAGAATTTACAG 
GGGGAGTTAATGGTACAGAGCCAGCTGTTCATGAAATCGCAGAGTATAAGGGATCTGATTCGCTTGTAACTCTTAC 
TACAAAAAAAGATTATACTTACAAAGCTCCTCTTGCTCAGCAGGCACTTCCTGAAACAGGAAACAAGGAGAGTGA 
CCTCCTAGCTTCACTAGGACTAACAGC I'rrCriCCTTGGTCTGTTTACGCTAGGGAAAAAGAGAGAACAATAA 

4188.10 

ATGTTTAAAGTITTACAAAAAGTrGGAAAAGCTTTTATGTTACCTATAGCTATACTTCCTGCAGCA^ 

GGGGATTGGTGGTGCACTTTCAAACCCAACCACGATAGCAACTTATCCAATACTAGACAATAGTATTTTTCAATCA 

ATATTCCAAGTAATGAGCTCTGCAGGAGAGGTTGTATTCAGTAATTTGTCACTACTTCTCTGTGTGGGATTATGTA 

TTGGCTTAGCGAAACGAGATAAAGGAACCGCTGCGTTAGCAGGAGTA^CTGGTTACTTAGTTATGACTGCAACGA 

TCAAAGCTTTGGTAAAACriMnUATGGCAGAAGGATCTGCAATTGATACTGGAGTTATTGGAGCATTAGTTGTCGG 

AATAGTTGCCGTATATTTGCACAACCGATATAACAATATTCAATTACCITC^ 

CGCTTCGTrCCTATTGTTACATCGTTCTCTTCTATCTTGATTGGCTTTGT C 1 ' l ' C T 11 GTTATTTGGCCACCTTTCCAA 
CAACTTCTTCrrrTCrACAGGTGGATATATTr^ 

ACTTTCTGGAGCAGTAGGCITACATCATATAATrTACCCTATGTTTTC 

TTGCAGG ACAAACAGTGGTTGGAGCTCAA AAAATA 1 ' 1 1 1 1 ' 1 GCTCAATTAGCCG ATTTGGCCCATTCTGGATTATT 

TACAGAAGGAACAAGGTTTTrTGCAGGTCGTrTCrCAACAATGATGTTCGGTTTAC 

TACCATAGTGTTCCTAAAAATCGTCGTAAAAAATACGCGGGTTTGTTTTTTGGAGTTGC^^ 

CGGTATTACAGAACCAATTGAATTTATGTTrCTATrCGTCAGTCCGGTTCTATATGTTGTrCACGCATTCCTTGA 

GTGTTAGCrrCTTrATTGCAGACGTCTTAAATATTTCAATAGGAAACACATTTTCAG 

TTATTTGGAATTTTGCAGGGGAACGCTAAGACGAATTGGGTTCTTCA 

GTATTATATTATTTTTAGATGGTTCATTACTCAATTCAACGTTCTAACGCCAGGGCGAGGAGAAGAAGTAGATrCrr 
AAAGAAATTTCTGAATCCGCAGATTCAACTTCAAATACTGCAGATTATTTAAAACAGGATAGCCTACAAATTATCA 
GAGCCTTGGGTGGATCAAATAATATAGAAGATGTAGATGCTTGTGTGACACGTTTACGTGTAGCTGTAAAAGAAG 
TTAATCAAGTTGATAAAGCACTTTTAAAACAAATTGGTGCAGTTGATGTCTTAGAAGTGAAGGGTGGCATTCAA 
AATCT ATGGAGCAAAAGCAATCTTATATAAAAATAGT ATT AATGAAATTTTAGGTGTAGATGATTAA 

4188.11 

ATGAAATTTAGAAAATTAGCTTGTACAGTACnTGCGGGTGCTGCGG 

GCGGAAGTAAAGATGCTGCCAAATCAGGTGGTGACGGTGCCAAAACAGAAATCACTTGGTGGGCATTCCCAGTAT 

rTACCCAAGAAAAAACTGGTGACGGTGTTGGAACITATGAAAAATCAATCATCGAAGCGTTTGAAAA^ 

CAGATATAAAAGTGAAATTGGAAACCATCGACTTCAAGTCAGGTCCTGAAAAAATCACAACAGCCATCGAAGCAG 

GAACAGCTCCAGACGTACTCTTTGATGCACCAGGACGTATCATCCAATACGGTAAAAACGGTAAATTGGCTGAGT 

TGAATGACCTCITCACAGATGAATTTGTTAAAGATGTCAACAATGAAAACATCGTACAAGCAAGTAAAGCTGGAG 

ACAAGGCTTATATGTATCCGATTAGTTCrGCCCCATTCTACATGGCAATGAACAAGAAAATGTTAGAAGATGCTGG 

AGTAGCAj^ACCrTGTAAAAGAAGGTTGGACAACTGATGATTTTGAAAAAGTATTGAAAGCACTTAAAGACAAGGG 

TrACACACCAGGTTCATTGTTCAGTrCrGGTCAAGGGGGAGACCAAGGAACACGTGCCrTTATCTCTAACCm 
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AGCGGTTCTGTAACAGATGAAAAAGTTAGCAAATATACAACTGATGATCCTAAATTCGTCAAAGGTCTTGAAAAA 

GCAACTAGCTGGATTAAAGACAATTTGATCAATAATGGTTCACAATTTGACGGTGG 

GCCAACGGTCAAACATCTTACACAATCCTTTGGGCACCAGCTCAAAATGGTATCCAAGCTAAACTm 

AGTAAGGTAGAAGTGGTAGAAGTACCATTCCCATCAGACGAAGGTAAGCCAGCTCTTGAGTACCTTGTAAACGGG 

TTTGCAGTATTCAACAATAAAGACGACAAGAAAGTCGCrGCATCTAAGAAATTCATCCAGTTTATCGCAGATGAC 

AAGGAGTGGGGACCTAAAGACGTAGTTCGTACAGGTGCTTTCCCAGTCCGTACTTCATTTGGAAAACTTTATGAAG 

ACAAACGCATGGAAACAATCAGCGGCTGGACTCAATACTACTCACCATACTACAACACTATTGATGGATTTGCTG 

AAATGAGAACACTTTGGTTCCCAATGTTGCAATCTGTATCAAATGGTGACGAAAAACCAGCAGATGCTT^ 

CCTTCACTGAAAAAGCGAACGAAACAATCAAAAAAGCTATGAAACAATAG 

4188.12 

ATGCAATCTACAGAAAAAAAACCATTAACAGCCTTTACTGTTATTTCAACAATCATTTTGCTCT^ 
TGTTCATCTTTCCATTCTACTGGATTTTGACAGGGGCATTCAAAT 

TGGTTCCCTAAAATGCCAACCATGGAAAACTTCCAACAACTCATGGTGCAGAACCCTGCCTTGCAATGGATGTGG 

AACrCAGTATTTATCTCATTGGTAACCATGTTCTTAGTTTGTGCAACCTCATCTCTAGCAGGTTATGTATT 

AAAACGTTTCTATGGTCAACGCATTCTATTTGCTATCTTTATCGCT 

TACCATTGGTACGTATCGTCAACTTCATGGGAATCCATGATACTCTCTGGGCAGTTATCTTGCCTTTGATTGGATG 
GCCATTCGGTGTCTTCCTCATGAAACAGTTCAGTGAAAATATCCCTACAGAGTTGCTTGAATCAGCTAAAATCGAC 
GGTTGTGGTGAGATTCGTACCTTCTGGAGTGTAGCCTTCCCGATTGTGAJ\ACCAGGGTTTGCAGCCCTTGCAATCT 
TTACCITCATCAATACTTGGAATGACTACITCATGCAATTGGTAATG^ 

CTTGGGGTTGCGACCATGCAGGCrGAAATGGCAACCAACTATGGTTTGATTATGGCAGGAGCTGCCCTTGCTGCTG 
TTCCAATCGTCACAGTCTTCCTAGTCTTCCAAAAATCCTTCACACAGGGTATTACTATGGGAGCGGTCAAAGGATA 
A 

4191.1 

ATG AAAAAAAC rTTTTTCT 1 ACTGGTGTT AGGCTTGTTTTGCCTTCTTCCACTCTCTG r I 'IT I GCCATTG ATTTCAAG 

ATAAACTCTTATCAAGGGGATTTGTATATTCATGCAGACAATACGGCAGAGTTrAGACAGAAGATAGTTTACCAGT 

TTGAGGAGGACTTTAAGGGCCAAATCGTGGGACTTGGACGTGCTGGTAAGATGCCTAGCGGGTTTGACATTGACC 

CTCATCCAAAGATTCAGGCCGCGAAAAACGGTGCAGAACTAGCAGATGTGACTAGCGAAGTAACAGAAGAAGCG 

GATGGTTATACTGTGAGAGTCTATAATCCAGGTCAGGAGGGCGACATAGTTGAAGTTGACCTCGTCTGGAACTTA 

AAAAATTTAC l" riUCCTTTATGATGATATCGCTGAATTAAATTGGCAACCTCTGACAGATAGTTCAGAGTCTATTG 

AAAAGTTTGAATTTCATGTAAGGGGAGACAAGGGGGCTGAAAAACTCnTTTrCCATACAGGGAAACTT^ 

AGGGAACGATTGAAAAGAGTAACCTTGATTATACTATCCGTTTAGACAATCTTCCGGCTAAGCGTGGAGTTGAGTT 

GCATGCCTATTGGCCTCGGACCGATTTTGCTAGCGCTAGGGATCAGGGATTGAAAGGGAATCGTTTAGAAGAGTTT 

AATA AGATAGAAGACTCGATTGTTAGAGAAAAAGATCAGAGTAAACAACTCGTTACTTGGGTCCTCCCTTCGATC 

CTTTCCATCTCCTTGTTATTGAGTGTCTGCITCrAT^ 

CAAAAATCATCGTCTCTATGAACCACCAATGGAATTAGAGCCTATGGTTTTATCAGAAGCAGTCTACTCGACCTCC 

TTGGAGGAAGTGAGTCCCrrGGTCAAGGGAGCTGGAAAATTCACCTTTGATCAACTTATTCAAGCTACCTTG 

ATGTGATAGACCGTGGGAATGTCTCTATCATTTCAGAAGGAGATGCAGTTGGTTTGAGGCTAGTAAAAGAAGATG 

GTTTGTCAAGCTTTGAGAAAGACTGCCTAAATCTAGCrVI i rCAGGTAAAAAAGAAGAAACTCrri'CCAATTTGTT 

TGCGGATTACAAGGTATCTGATAGTCTTTATCGTAGAGCCAAAGTTTCTGATGAAAAACGGATTCAAGCAAGAGG 

GCTTCAACTCAAATCl'l'ClU'llGAAGAGGTATTGAACCAGATGCAAGAAGGAGTGAGAAAACGAGTTTCCTTCTGG 

GGGCTCCCAGATTATTATCGTCCTTTAmACTGGTGGGGAAAAGGCCTTGCAAGTGGGTATGGGTGCCTTGACTATCC 

TGCCCCTATITATCGGATTTGGTTTGTTCTTGTACAGTTTAGA 

CTTGGTTTTCTAGGGTTAGTTTTGTCTGTTTTCT 

TGAAGCGGGAGCTGAGGTCTACTATCTCTGGACCAGTTTTGAAAATATGTTGCGTGAGATTGCACGATTGGATCAG 
GCTGAACTGGAAAGTATTGTGGTCTGGAATCGCCTCITGGTCTATGCGACCrTTATTTGGCTATGCGGACAAGGTTA 
GTCATTTGATGAAGGTTCATCAGATTCAAGTGGAAAATCCAGATATCAATCTCTATGTAGCTTATGGCTGGCACAG 
TACGTTTTATCATTCAACAGCACAAATGAGCCATTATGCTAGTGTCGCAAATACAGCAAGCACCTACTCTGTATCT 
TCTGG AAGTGG AAGTTCTGGTGGTGGCTTCTCTGG AGGCGG AGGTGGCGGCAGTATCGGTGCC 11 1 1AA 

4191.2 

ATGAAAAA AGTAA GAAAGATATTTCAGAAGGCAGTTGCAGGACTGTGCTGTATATCTCAGTTGACAGCri'1'ri'Cl'l 

CGATAGTTGCTTTAGCAGAAACGCC TGAA ACCAGTCCAGCGATAGGAAAAGTAGTGATTAAGGAGACAGGCGAAG 

GAGGAGCGCTTCTAGGAGATGCCGTCTTTGAGTTGAAAAACAATACGGATGGCACAACTGTTTCGCAAAGGACAG 

AGGCGCAAACAGGAGAAGCGATATTTTCAAACATAAAACCTGGGACATACACCTTGACAGAAGCCCAACCTCCAG 

TTGGTTATAAACCCTCTACTAAACAATGGACTGTTGAAGTTGAGAAGAATGGTCGGACGACTGTCCAAGGTGAAC 

AGGTAGAAAATCGAGAAGAGGCTCTATCTGACCAGTATCCACAAACAGGGACTTATCCAGATGTTCAAACACCTT 

ATCAGATTATTAAGGTAGATGGTTCGGAAAAAAACGGACAGCACAAGGCGTTGAATCCGAATCCATATGAACGTG 

TGATTCCAGAAGGTACACTTTCAAAGAGAATTTATCAAGTGAATAATTTGGATGATAACCAATATGGAATCGAATT 

GACGGTTAGTGGGAAAACAGTGTATGAACAAAAAGATAAGTCTGTGCCGCTGGATGTCGTTATCTTGCTCGATAA 

CTCAAATAGTATGAGTAACATTCGAAACAAGAATGCTCGACGTGCGGAAAGAGCTGGTGAGGCGACAGGTTCTCT 

TATTGATAAAATTACATCTGATTCAGAAAATAGGGTAGCGCrTGTGACrrATGCTTCCACTATCTTTGATGGGACC 

GAGTTTAC AGTA GAAAAAGGGGTAGCAGATAAAAACGGAAAGCGATTGAATGATTCT Ciri 11 TGGAATTATGAT 

CAGACGAGTTTTACAACCAATACCAAAGATTATAGTTATTTAAAGCTGACTAATGATAAGAATGACATTGTAGAAT 
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TAAAAAATAAGGTACCTACCGAGGCAGAAGACCATGATGGAAATAGATTGATGTACCAATTCGGTGCCACTTTTA 
CTCAGAAAGCTTTGATGAAGGCAGATGAGATTTTGACACAAC^ 

ATATTACGGATGGTGTCCCAACTATGTCGTATCCGATTAATTTTAATCATGCTACGTTTGCTCCATCA 
CAACTAAATGCATTTTTTAGTAAATCTCCTAATAAAGATGGAATACTATTAAGTGATTTTATT 
5 GTGGAGAACATACAATTGTACGCGGAGATGGGCAAAGTTACCAGATGTTTACAGATAAGACAGTTTATGAAAAAG 
GTGCTCCTGCAGCTITCCCAGTTAAACCTGAAAAATATTCT 

TCCAATTAATGGTGGATATATTTGGCITAATTGGAGAGAGAGTATTCTGGCrrTATCCGTTT 

AAATTACCAATCATGGTGACCCTACAAGATGGTACTATAACGGGAATATTGCTCCTGATGGGTATGATGTCTTTAC 
GGTAGGTATTGGTATTAACGGAGATCCTGGTACGGATGAAGCAACGGCTACTAGTTTTATGCAAAGTATTTCTAGT 

10 AAACCTGAAAACTATACCAATGTTACTGACACGACAAAAATATTGGAACAGTTGAATCGTTATTTCCACACCATC 
GTAACTGAAAAGAAATGAATTGAGAATGGTACGATTACAGATCCGATGGGTGAGTTAATTGATTTGCAArrGGGC 
ACAGATGGAAGATTTGATCCAGCAGATTACACTTTAACTGCAAACGATGGTAGTCGCTTGGAGAATGGACAAGCT 
GTAGGTGGTCCACAAAATGATGGTGGTTTGTTAAAAAATGCAAAAGTGCTCTATGATACGACTGAGAAAAGGATT 
CGTGTAACAGGTCTGTACCTTGGAACGGATGAAAAAGTTACGTTGACCTACAATGTTCGTTTGAATGATGAGTTTG 

15 TAAGCAATAAATTTTATGATACCAATGGTCGAACAACCTTACATCCTAAGGAAGTAGAACAGAACACAGTGCGCG 
ACTTCCCGATTCCTAAGATTCGTGATGTGCGGAAGTATCCAGAAATCACAATTTCAAAAGAGAAAAAACTTGG 
ACATTGAGTTTATTAAGGTCAATAAAAATGATAAAAAACCACTGAGAGGTGCGGTCTTTAGTCTTCAAAAACAAC 
ATCCGGATTATCCAGATATTTATGGAGCTATTGATCAAAATGGCACTTATCAAAATGTGAGAACAGGTGAAGATG 
GTAAGTTGACCTTTAAAAATCTGTCAGATGGGAAATATCGATTATTTGAAAATTCTGAACCAGCTGGTTATAAACC 

20 CGTTCAAAATAAGCCTATCGTTGCCTTCCAAATAGTAAATGGAGAAGTCAGAGATGTGACTTCAATCGTTCCACAA 
GATATACCAGCGGGTTACGAGTTTACGAATGATAAGCACTATATTACCAATGAACCTATTCCTCCAAAGAGAGAA 
TATCCrCGAACTGGTGGTATCGGAATGTTGCCATTCTATCTGATAGGTTGCATGATGATGGGAGGAGTTCTATTAT 
ACACACGGAAACATCCGTAA 

25 

4191.3 

atgaaatcaatcaacaaatttttaacaatgcttgctgccttattactc 

cagtttttgcggctgggacgacaacaacatctgttaccgttcataaactattggcaacagatggggatatggata 
30 aaattgcaaatgagttagaaacaggtaactatgctggtaataaagtgggtgttctacctgcaaatgcaaaagaaa 
ttgccggtgttatgttcgtttggacaaatactaataatgaaattattgatgaaaatggccaaactctaggagtgaa 
tattgatccacaaacatttaaactctcaggggcaatgccggcaactgcaatgaaaaaattaacagaagctgaagg 
agctaaatttaacacggcaaatttaccagctgctaagtataaaatt^ 

ggtgaagatggagcaaccttaacaggttctaaagcagttccaattgaaattgaattaccattgaacgatgttgtg 
35 gatgcgcatgtgtatccaaaaaatacagaagcaaagccaaaaattgataaagatttcaaaggtaaagcaaatcca 
gatacaccacgtgtagataaagatacacctgtgaaccaccaagttggagatgttgtagagtacgaaattgttaca 
aaaattccagcacttgctaattatgcaacagcaaactggagcgatagaatgact 

GGTACAGTGAAAGTAACTGTTGATGATGTTGCACTTGAAGCAGGTGATTATGCTCTAACAGAAGTAGCAACTGGTT 
TTGATTTGAAATTAACAGATGCTGGTTTAGCTAAAGTGAATGACCAAAACGCTGAAAAAACTGTGAAAATCACT^ 

40 ATTCGGCAACArTGAATGACAAAGCAATTGTAGAAGTACCAGAATCTAATGATGTAACATTTAACTATGGTAATA 
ATCCAGATCACGGGAATACTCCAAAGCCGAATAAGCCAAATGAAAACGGCGATTTGACATTGACCAAGACATGGG 
TTGATGCTACAGGTGCACCAATTCCGGCTGGAGCTGAAGCAACGTTCGATTTGGTTAATGCTCAGACTGGTAAAGT 
TGTACAAACTGTAACTTTGACAACAGACAAAAATACAGTTACTGTTAACGGATTGGATAAAAATACAGAATATAA 
ATTCGTTGAACGTAGTATAAAAGGGTATTCAGCAGATTATCAAGAAATCACTACAGCTGGAGAAATTGCTGTCAA 

45 GAACTGGAAAGACGAAAATCCAAAACCACTTGATCCAACAGAGCCAAAAGTTGTTACATATGGTAAAAAGTTTGT 
CAAAGTTAATGATAAAGATAATCGTTTAGCTGGGGCAGAATTTGTAATTGCAA 

TTAGCACGTAAAGCAGATAAAGTGAGTCAAGAAGAGAAGCAGTTGGTTGTTACAACAAAGGATGCTTTAGATAGA 
GCAGTTGCTGCITATAACGCTCTTACTGCACAACAACAAACTCA 

GCTGCTTATAATGCTGCTGTGATTGCTGCCAACAATGCATTTGAATGGGTGGCAGATAAGGACAATGAAAATGTTG 
50 TGAAATTAGTTTCTGATGCACAAGGTCGCTTTGAAATTACAGGCCI IC'I 1 GCAGGTACATATTACTTAGAAGAAAC 

AAAACAGCCTGCTGGTTATGCATTACTAACTAGCCGTCAGAAATTTGAAGTCACTGCAAC14'Cl"l ATTCAGCGACT 
GGACAAGGCATTGAGTATACTGCTGGTTCAGGTAAAGATGACGCTACAAAAGTAGTCAACAAAAAAATCACTATC 
CCACAAACGGGTGGTATTGGTACAATTATCTTTGCTGTAGCGGGGGCTGCGATTATGGGTATTGCAGTGTACGCAT 
^ ATGTT AAA AAC A AC AAAGATGAGGATCAACTTGCTT A A 

4191.4 

ATGACAATGCAGAAAATGCAGAAAATGATTAGTCGTAT C1 ICn 1 GTTATGGCTCTGTG F IT 1"! CTCTTGTATGGGG 

TGCACATGCAGTCCAAGCGCAAGAAGATCACACGTTGGTCTTGCAATTGGAGAACTATCAGGAGGTGGTTAGTCA 

ATTGCCATCTCGTGATGGTCATCGGTTGCAAGTATGGAAGTTGGATGATTCGTATTCCTATGATGATCGGGTGCAA 

60 ATTGTAAGAGACTTGCATTCGf GGGATGAGAATAAAC ITTCn CTT1 CAAAAAGACTTCGTTTGAGATGACCTTCC 
TTGAGAATCAGATTGAAG TATCTCA TATTCCAAATGGTCTTTACTATGTTCGCTCTATTATCCAGACGGATGCGGT 
TTCl* I* ATCC AGCTG AA lUMClH'I^'rGAAATGACAGATCAAACGGTAGAGCCTTTGGTCATTGTAGCGAAAAAAACA 
GATACAATGACAACAAAGGTGAAGCTGATAAAGGTGGATCAAGACCACAATCGCTTGGAGGGTGTCGGCTTTAAA 
TTGGTATCAGTAGCAAGAGATGTTTCTGAAAAAGAGGTTCCCITGATTGGAGAATACCGTTACAGTTCTTCTGGTC 

65 AAGTAGGGAGAACTCTCTATACTGATAAAAATGGAGAGAr i riTGTGACAAATCTTCCTCTTGGGAACTATCGTTT 
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CAAGGAGGTGGAGCCACTGGCAGGCTATGCTGTTACGACGCTGGATACGGATGTCCAGCTGGTAGATCATCAGCT 

GGTGACGATTACGGTTGTCAATCAGAAATTACCACGTGGCAATGTTGACTTTATGAAGGTGGATGGTCGGACCAA 

TACCTCTCTTCAAGGGGCAATGTTCAAAGTCATGAAAGAAGAAAGCGGACACTATACTCCTGTTCTTCAAAATGGT 

AAGGAAGTAGTTGTAACATCAGGGAAAGATGGTCGTTTCCGAGTGGAAGGTCTAGAGTATGGGACATACTATTTA 

TGGGAGCTCCAAGCTCCAACTGGTTATGTTCAATTAACATCG 

AGGAACTGGTAACAGTGGTTAAAAATAACAAGCGACCACGGATTGATGTGCCAGATACAGGGGAAGAAACCCTTG 
TATATCTTGATGCTTGTTGCCATTTTGTTGTTTGGTAG 

4191.5 

ATGAGCCACATATACTTATCTATTTTCACAAGTCTCTC^ 

AATATTTACGCATCGGTATGGAAGCAGCATATGCTCCCTTTAACTGGACCCAGGATGATGATAGCAACGGAGCTG 
TCAAAATCG ATGG GACCAATCAGTATGCCAACGGATACGATGTTCAAATCGCCAAGAAAATCGCTAAGGACTTAG 
GTAAAGAACCITTGGTTGTTAAA^CCAAGTGGGAAGGTCTAGTCC 

TA TCGC AGGTATGAGTCCAACTGCAGAACGCAAACAAGAAATTGCCTTTTCGAGCAGTTACTATACTAGCGAACC 
AGTTTTGCTTGTCAAAAA AGAT TCTGCCTACGCAAGTGCTAA^ 

TCTCAACAAGGGGTCTACCTTTATAACTTGATTGCACAAATCCCAGGTGCTAAAAAAGAAACAGCCATGGGAGAC 

TTCGCTCAAATGCGACAAGCTCTTGAGGCTGGTGTCATTGATGCTTATGTTTCTGAACGTCCAGAAGCACT 

CTGAAGCTGCGAACTCTAAGTTCAAGATGATTCAAGTAGAACCTGGTTTCAAAACTGGGGAAGAAGATA 

TCGCTATCGGGCTTCGTAAAAATGACAATCGTATTAGCCAAATCAATGCCAGCATTGAAACCATTTCAAAAGATG 

ACC AAGTT GCCTTGATGGATCGTAT GATCA AGGAACAACCTGCCGAAGCTACAACAACTGAAGAGACTAGCAGTA 

GTTTCTTTAGCCAAGTTGCTAAAATTCTTTCTGAAAACTGGCAACAACTCTTGC 

ATCTCTATCGTCGGAACCATCATAGGTCTCATTATTGGACTTGCCATTGGTGTCTTCCGTACTGCTCCTCTCTCT 

AAACAAAGTCATTTACGGCCTACAAAAACTAGTCGGCTGGGTTCTCAATGTCT 

CCAATGATTGTTCAATCGATGGTTATCTACTATGGAACTGCCCAAGCTTTCGGGATCAACC 

TTGACCGTACACTGGCTGCTATCTTCATCGTTTCAATCAATACCGGTGCCTACATGACTGAAATCGTCCGTGGTGG 
TATCCTAGCAGTTGACAAGGGACAATTTGAAGCTGCGACTGCTCTTGGTATGACCCATAACCAGACCATGCGTAA 
GATTGTCCTACCTCAGGTAGTCCGTAACATCCTACCTGCAACTGGTAATGAATTTGTCATCAATATCAAAGATACA 
TCTGTATTGAACGTTATCTCTGTTGTCGAA CTTTA TT^ 

CCAGACATTTACAATCATCGCCGTGATTTACITTGTCCTCACCTTCACCGTAACACGTATCCTACGCTTTATCGAGC 
GCAGAATGGACATGGATACCTACACTACAGGTGCTAACCAAATGCAAACGGAGGATTTGAAATAA 

4191.6 

ATGACACAAGCAATCCTTGAAATTAAACACCTCAAAA^TCCTATGGACAAAACGAAGTGCTAAAAGACA 

CTCACTGTCCACAAGGGAGAGGTCATCTCTATCATCGGAAGCTCTGGAAGCGGAAAATCGACCTTCCTACGCTCC 

ATTAACCTACTTGAAACACCAACTGATGGACAAATCCTTTATCATGGACAAAACGTCCTCGAAAAAGGCTATGAC 

CTCACGCAATACCGTGAAAAGTTGGGGATGGTTTTCCAATCCrrTAACCTCTTTGAAAATCTCAA 

ACACAATCGTCGCTCAGACAACTGTCCTAAAACGCGAACGCACAGAAGCTGAAAAGATTGCCAAAGAAAACCTG 

GAAAAGGTCGGCATGGGAGAACGCTACTGGCAAGCCAAACCAAAACAACTCTCAGGTGGTCAAAAACAACGTGT 

GGCCATCGCTCGTGCCCTCTCCATGAATCCGGACGCTATTCTCTTTGATGAACCAACATCAGCTCTCGATCCAGAA 

ATGGTTGGAGAAGTCCTCAAAATCATGCAGGACCTGGCTCAGGAAGGCTTGACCATGATTGTCGTAACCCATGAA 

ATGGAATTTGCCCGTGATGTCTCTCACCGTGTTATCTTTATGGATAAGGGCGTGATCGCTGAAGAAGGTAAACCAG 

AAGACCTCTTCACCAATCCTAAAGAAGACCGAACAAAAGAGTTCCITCAACGCTATCTCAAATAA 

4192.3 

ATGAAAAAGTATCAACTTCTATTCAAAATAAGTGCAGTCTTCT 

GA CGCTTA TCGTCCAAAACTATTGGCAATVlTCrrCTCAGATAGGCAATTTATTCTGGATTCAAAATATCTTGA 
T AC l lll 1 A TTGG AGTC ATG A TTGTGGTTCTTGTT AAG AC AGGCC ATGGTT ATCTCTTCCGC ATTCCAAG A A AAAA 
ATGGCTTTGGTATTCGATTTTGACAGTATTAGTGCrAGTGTTCCAGATCTCTTTTAACGTTCA 

TTCAGTCAA CTGC GGAAGGTTGGGCTGTATTGATTGGTTATAGTGGGACTAACTTTGCAGAGCTAGGTATTTATAT 

AGCCCTGTTCTTTCTGGTTCCACTGATGGAAGAATrGATTTATAGAGGATrACTGCAACATG Cr 

CGCGATTTGGTCTTGATTTGCTTCTTCCTTCTATTT^ 

ATATCTTCGTCTTTGCAACAGTTGGAATCATCTTTC 

GCGGTGCATGTGATCAATAATATTGTAGCGACCTTCCCGTTTTTGCTCACTTTTCTACATAGGGTCTTGGG 
4193.1 

ATGAACAAGAAACAATGGCTAGGTCTTGGCCTAGTTGCAGTGGCAGCAGTTGGACTTGCTGCATGTGGTAACCGC 
TCTTCTCGTAACGCAGCTTCATCnTCTGATGTGAAGACAuAAAGCAGCAATCGTCACTGATACTGGTGGTGTTGATG 
ACAAATCATTCAACCAATCAGCTTGGGAAGGTTTGCAGGCTTGGGGTAAAGAACACAATCTTTCAAAAGATAACG 
GTTTCACTTACTTCC AATC AACAACTGAAGCTGACTACCCT 

AATCTTCGGTGTTGGTTTTGCCCTTAATAATGCAGTTAAAGATGCAGCAAAAGAACACACTGACTTGAACTATC 

TTGATTGATGATGTGATTAAAGACCAAAAGAATGTTGCGAGCGTAACTTTCGCTGATAATGAGTCAGGTTACcTTG 

CAG GTGTG GCTGCAGCAAAAACAACTAAGACAAAACAAGTTGGTTTTGTAGGTGGTATCGAATCTGA^ 

CTCGTTTTGAAGCAGGATTCAAGGCTGGTGTTGCGTCAGTAGACCCATCTATCAAAGTCCAAGTTGACTACGCTGG 

TTCATTTGGTGATGCGGCTAAAGGTAAAACAATTGCAGCCGCACAATACGCAGCCGGTGCAGATATTGTTTACCA 

AGTAGCTGGTGGTACAGGTGCAGGTGTCTTTGCAGAGGCAAAATCTCTCAACGAAAGCCGTCCTG AAA ATG AAAA 
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AGTTTGGGTTATCGGTGTTGATCGTGACCAAGAAGCAGAAGGTAAATACACTTCTAAAGATGGCAAAGAATCAAA 
CTTTGTTCITGTArCTACTTrGAAACAAGTTGGTACAACTGTAAAAGATATTTCTAAC 

TTCCCTGGCGGTCAAGTGATCGTTTACTCATTGAAGGATAAAGGGGTTGACTTGGCAGTAACAAACCTTTCAGAAG 

AAGGTAAAAAAGCTGTCGAAGATGCAAAAGCTAj^AATCCTTGATGGAAGCGTAAAAGT 

TCCTGAAAAATAA 

4193.3 

ATGTCT AAAAA ATT A C AAC A AATTTCG GTTCC CTTG ATTTCTGT ATTCCT AG G AATTTT A CTCG G AG CC ATTGTC AT 

GTGGATCrrCGGTTATGATGCrATTTGGGGCTACGAAGAATTGTTCTATACAGCCrrTGGCAGTCTGCGTGGGATT 

GGAGAAATCTTCCGTGCTATGGG TCCT CTGGTCTTGATTGGTCTTGG ri 1T GCCGTTGCCAGTCGACCTGGTTTCTT 

TAACGTCGGACTTCCTGGTCAGGCTTTGGCAGGTTGGATrTCTCAGTGGTTGGTTrGCCCrGTCGCATC 

CCCCGTCCCTTGATGATTCTAGCAACCATCGTGATTGCCTTGATTGCrGGTGGGATTGTCGGAGCGATTCCAGGTA 

TTC TTAG GGCCTAT CTAG GGACGTCAGAGGTTATTGTAACCATCATGATGAACTACATTGTCTTGTATGTAGGGAA 

TGCCITTATCCATGCTTTCCCTAAAGACTITCATGC^ 

TATCAGACACCTTGGTTGGCTGAGTTG ACTGGTAACTCACGGATGAATATTGGTA rTTTCTTI GCCATCATTGCCGT 

TGCAGTTATTTGGTTCATGCTCAAGAAAACAACTCTTGGTTTTGAAATCCGTGCAGTTGGTC^ 

CAGAATATGCTGGTATTTCTGCCAAGCGGACTATTATCCTATCTATGATTATTTCAGGTGCCTTGGCAGGTCTTGG 

GGAGCTGTTGAAGGTTTGGGAACCrTCCAGAACGTCTATGTTCAAGGTTCGTCATTAGCTATCGGATTTAACGGAA 

TGGCGGTTAGTTTGCTTGCGGCCAACTCACCAATTGGTATACTCTTTGCAGCCTTCCTATTTGGCGTTCTCC 

GGGCCTCCTGGTATGAATGCGGCG CAGG TACCATCTGAGCTTGTCAGCATTGTAACAGCGTCTATTATCriCrriG 

TCAGTGTTCATTACCTTATCGAACGCnTTGTCAj\ACCGAAAAAA^ 

4194. 1 

ATGGGAGTGAAAAAGAA^CTAAAGTTGACTAGTTTGCrAGGACTGTCTCTGTTAATCATGACAGCCTGTGCG 
AATGG GGT AACT AGCG ATATT AC AGCCG AATCGGCTG A 1 " T' ITT GG AGT A AATTGGTTT AC 1" I'C'iT l'GCGG AAATC A 
TTCGC TTTTT ATCGTTTGATATTAGTATCGGAGTGGGGATTATTCrcrrTTACGGTCTTGA 

CCAGTCTTTCAGGTGCA^ATGGTGG CTTCT AGGAAAATGCAGGAAGCTC AG CC ACGCATTAAGGCGCTTCG AG AA 

CAATATCCAGGTCGAGATATGGAAAGCAGAACCAAACTAGAGCAGGAAATGCGTAAAGTATTTAAAGAAATGGG 

TGTCAGACAGTCAGACTCTCTTTGGCCGATTTTGATTCAGATGCCGGTTATTITGGCCCTGTT 

GAGTTGACTTTTTA-AAGACAGGTCATTrCTTATGGATTAACCrrG 

TrAGCAGCAGTATTCACCTTTTTAAGTACTTGGTTGTCCAACAAAGCnTTGTCr 

CGATGATGTATGGGATTCCAGTCTTGAITI Tl ATCTTTGCAGTTTATGCGCCAGGTGGAGTCGCCCTATACTGGAC 
AGTGTCTAATGCTTATCAAGTCTTGCAAACCTATITC1TGAATAATCCATTC 

GTAGTACAGGCACAAAAAGATTTGGAAAATAGAAAAAGAAAAGCCAAGAAAAAGGCTCAGAj^^ 
4194.4 ' 

ATGGTTATCGATCCATTTGCTATCAACGAACTAGACTATTACITAGTTTCACACT 

CATACACAGCTGCAGCAATTCrCAJKTAATCCTAAGTTAGAGCATGTTAAGTTTATCGGTCCnTACCACT^ 

AATCTGGGAAGGATGGGGTGTTCCAAAAGAACGTATCATCGTTGTTAAACCAGGTGACACTATCGAATTAAAAGA 

TATGAAGATTCATGCAGTAGAATCATTTGACCGTACTTGCITGGTAACTCrCCCAGTGAACGGTGCTGATGAGACA 

GGCGGTGAACTTGCTGGCnTGGCTGTTACAGATGAAGAAATGGCTCAAAAGGCTGT^ 

CAGGTGGA^CCATCrATCATGGTGCAGATTCTCACTTCTCAAACTATITTGCAAAACATGGTA 

TGATGTTGCTTTGAATAACTATGGTGAAAATCCGGTAGGTATCCAAGACAAAATGACATCTATCGACCrTCITCGT 

ATGGCAGAAAATCTGCGTACCAAAGTCATTATCCCAGTTCACTATGATATCT^ 

ATGAGATTCTAGAACTTTGGAAAATGCGAAA^GATCGCITGCAATACGATrrCCATCCAT^ 

CGGTAAGTACACTTATCCrCAAGATCAACACTTAGTAGAATACCATCATCCACGTGGTTTTGATGATTGTTTTGAA 

C AAG ACTCTA A C ATTC AATTT AAAGCTTTG CT ATA A 



4196.2 

ATG TTCCTT TCAGGC TGGT TGTCTAGTTTTGCTAATACnT ATATCCATG ATTTACT 1 ' 1 1 1 CCCAGATAG 

TCCATTTTTAAATGCCTTTGAAAGTGCTAT^ 

TTGTTTTGGCTTTGATTCCTGTGCGAAAATTAAAATCTTTGTTTTTA 

ATGATTAAGGATATTGGTTATATTCGTACGGATTrGCCAGAGGGCTTTGACTTTACTATTTC 
GTATCATCTCAGGAATTGCCTCTCACTGGACITTTTCAGGTCTAGCT 

T AT A AA GG AC A G AAGGTTGGCAAG A A AC AGGGCCTT A 1 ' 1 ' 11 ' I CT AGGTTT AGCCTTG GG AACTCACTTCTTGTTT A 
ACTCTCli ri iGTG GAGT TGGAAACAGAGrTGCCTTTAG 
TATCATGCHTATTGCTTTGTTGAGAAACACAATGAGTTGATGACCTAG 

4197.1 

ATG AAGGTGG AACC ACGTTGCG ACGTCCTTTCG AGG ATGTCGC A nTTTT 1' ATTAGG AT ACT AATTATGG AGTTGC 
AAGAATTAGTGGAGCGCAGTTGGGCAATCCGACAAGCTTATCACGAACTGGAAGTTAAGCATCATGATTCCAAGT 
GG ACGGT AG AAG AAG ACCTCITGGCTTT AT CT A ATG ATATTGGAAATTTCCAACGACTGGTG ATG AC AAAGC A AG 
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GACGCTACTATGATGAAACACCCrACACACTGGAACAAAAACTTTCAGAAAATATCTGGTGGCTATTAGAACT^ 
CTCAACGTTTGGATATAGACATTCTGACGGAAATGGAAAACTTCCTCTCTGATAAAGAAAAGCAATTGAACGTTA 
GGACTTGGAAGTAG 

5 4197.4 

ATGCTTGATTGGAAACAA'i'i'ri'l'l CTAGCCTATCTGCGCTCCCGTAGTCGTCTTTTTATCTATCTGCrrrCi'l'l GGC 
ATTTCTTGTCTTACTCTTTCAGTTTTTATTTGCCAGTCTAGGAAT^ 

GTAACCATATTATTTTTCACTTGGGACATATTGGTGGAAACGCAGGTCTATCGCCAGGAACTTCTCTATGGAGAGA 
GGGAAGCCAAGTCTCCTTTGGAAATAGCTTTAGCAGAAAAATTAGAAGCGCGTGAGATGGAACTCrATCAGCAGA 

10 GGTCAAAAGCAGAAAGAAAACTGACGGATTTGCTGGATTACTATACCTTGTGGGTCCATCAGATAAAGA CCCC CA 
TTGCAGCCAGTCAACTCTTAGTTGCAGAAGTGGTCGACCGCCAACTGAAGCAGCAGCTAGAACAGGAAATTTTCA 
AAATCGACTCCTATACCAACCTAGTTTTACAGTACCTGCGTTTAGAAAGTTTCCATGATGATTTGGTCTTAAAGC^ 
GGTTCAAATTG AGG ACTTGGTCAAGG AAATAATTCGTAAATATGCTC 1" 1 1 ' I'CTTTATTCAAAAAGGCTTAAATGTC 
AATCTACATGACCn'GATAAAGAAATCGTGACGGATAAAAAGTGGCrGCTAGTGGTTATTGAGCAAATCATCTCA 

15 AACAGTCTCAAGTACACCAAGGAAGGTGGTCTGGAGATTTATATGGATGACCAAGAGCrTTGTATCAAAGATACG 
GGAATCGGGATAAAAAACAGTGATGTCCTCCGAGTATTTGAACGTGG C T fTV CAGGATACAATGGCCGTTTGACCC 
AGCAGTCCTCrGGACTTGGCCTTTATCTATCTAAGAJVAATTTCTGAAGAACTGGGGCACC^ 
TGAGGTCGGAAAAGGAACGACAGTGCGGATTCAGTTTGCTCAAGTGAACTTAGTCCTTGAGTAA 

20 4211.2 

ATGGAACTTAATACACACAATGCTGAAATCTTGCTCAGTGCAGCTAATAAGTCCCACTATCCGCAGGATGAACTG 
CCAGAGATTGCCCTAGCAGGGCGTTCAAATGTTGGTAAATCCAGCTTTATCAACACTATGTTGAACCGTA AGAA TC 
TCGCCCGTACATCAGGAAAACCTGGTAAAACCCAGCTCCTGAACTTTTTl'AACATTGATGACAAGATGCGCTTTGT 
GGATGTGCCTGGTTATGGCTATGCrCGTGrrrCTAAAAAGGAACGTGAAAAGTGGGGGTGCATGATTGAGGAGTA 
25 CTTAACGACtCGGGAAAATCTCCGTGCGGTTGTCAGTCTAGTTGACCTTCGTCATGACCCGTCAGCAGATGATGTG 
CAGATGTACGAATTTCTCAAGTATTATGAGATTCCAGTCATCATTGTGGCGACCAAGGCGGACAAGATTC CTCGTG 
GTAAATGGAACAAGCATGAATCAGCAATCAAAAAGAAATTAAACTTTGACCCGAGTGACGATTTCATCCTct l l iC 
ATCTGTCAGTAAGGCAGGGATGGATGAGGCTTGGGATGCAATCTTAGAAAAATTGTGA 

30 4211.3 

ATGACAAAGAAACAACTTCACTTGGTGATTGTGACAGGGATGAGTGGCGCAGGGAAAACTGTAGCCATTCAGTCC 
TTCG AGGATCTAGGTTATTTCACCATTGATAATATGCCGCCAGCTCTCTTGCCTAAG m r I'GCAGCTGGTTGAAAT 
TAAGGAAGACAATCCTAAGTTGGCCTTGGTAGTGGATATGCGTAGCCGTTLMllCirT'TCAGAGATTCAAGCTGTT 
TTGGATGAGTTGGAAAATCAAGATGGTTTGGATTTCAAAATCCTCrriUUGGATGCGGCTGATAAGGAATTGGTCG 

35 CTCGTTACAAGGAAAtCAGACGGAGTCACCCACTAGCAGCAGACGGTCGTATTTTAGATGGAATCAAGTTGGAAC 
GTGAACTCTTGGCACCTTTGAAAAATATGAGCCAAAATGTGGTGGATACGACTGAACTCACTCCA CGTG AGcTGCG 
CAAAACCCTTGCAGAGCAGTTrTCAGACCAAGAACAAGCCCAGTCITTCCGTATCGAAGTCATGTCTT^ 
AAGTATGGAATCCCGATTGATGCGGACTTGGTCTTTGATGTCCG rTTCT 1'GCCAAATCCCTATTATT TACCA GAACT 
GAGAAACCAAACGGGTGTGGATGAACCTGTTTATGATTATGTCATGAACCATCCTGAGTCAGAAGACTTTTATCAA 

40 CATTTATTGGCCTTGATTGAGCCGATTCTGCCAAGTTACCAAAAGGAAGGTAAGTCCGTTTTGACCATTGCCATC 

GATGTACGGGTGGACAACACCGTAGTGTGGCATTTGCTAAACGCTTGGCGCAGGACTTATCCAAGAATTGGTCTGT 
TAATGAAGGGCATCGCGACAAAGACCGCAGAAAGGAAACGGTAAACCGTTCATGA 

4211.4 

45 ATGAGAAAACCAAAGATAACGGTGATTGGTGGAGGGACTGGAAGTCCCGTCATTCTAAAAAGTCTGCGGGAAAAA 
GATGTGGAAATCGCAGCTATCGTGACGGTGGCAGATGATGGTGGTTCTTCAGGTGAACTCCGAAAAAATATGCAA 
CAGTTGACACCGCCAGGTGATCTTCGTAATGTCCTTGTGGCCATGTCGGATAT GCCTAAGTTTTATGAGAAGGTCT 
TTCAGTATCGGTTCTCTGAGGATGCCGGAGCCTTTGCTGGCCATCCATTGGGAAATCTCATCATTGCTGGCTTGTC 
AGAAATGCAGGGTTCAACCTATAATGCCATGCAGTTATTGAGCAAATTTTTCCATACAACAGGGAAAATTTATCCT 

50 TCCAGTGACCATCCTTTGACCCTTCATGCAGTCTTTCAGGATGGGACAGAAGTGGCTGGAGAGAGTCATATTGTAG 
ACCATCGAGGCATAATTGACAATGTCTATGTGACCAATGCCCTAAACGATGATACGCCTCTGGCCAGCCGTCGAG 
TAGTGCAGACCATCCTTGAuAAGTGACATGATTGTCCTAGGGCCAGGTTCCCTCTTTACCTCTATTTTGCCCAAT^ 
CGTG ATT AAGG A AATTGGGCGGGCTC fTT i'GG A AACC AAGGC AG AAATTGCCT ATGTCTGC AAT ATC A TG ACC C A 
ACGTGGGGAGACGGAACACTTTACAGATAGCGACCACGTGGAAGTCTTGCATCGTCACCHTGGTCGCCCTTTTATC 

55 GACACTGTCrrrGGTGAATATTGAAjVAAGTGCCTCAGGAATACATGAATTCCAACCGTTTTGATGAATACrTAGTGC 
AAGTG G AAC ACG ATTTTGT AG GTCTTTGTAAGC AAGTTTCG CGCGTG ATTTC ATCT AACTTCCTTCGTCTGG AA AA 
TGGCGGTGCCTTCCACGATGGAGATTTGATTGTGGACGAGTTGATGCGCATTATACAGGTGAAAAAATGA 



tJ * 1 

60 ATGAAAAATTTGATAAAGTTGCTAATAATTAGATTGATTGTTAACrTAGCAGACAGTGTATTTTATATAGTAGCAT 
TGTGGCACGTTAGCAATAATTATTCTTCGAGCATGTTCTTAGGAATATTTATTGCAGTAAATTATCT 
TTACTAATC IT IT ' nGGACCAGTTATTGACAGAGTAAATCCGCAAAAAATTCTTATAATATCAATTTTGGTTCAATT 
AGCAGTGGCTGTA AT A IT 1 I ' I ATT ATT ATTAAACC AAATATCATTTTGGGTG AT AATGAGTCTAGTGTTT ATTTC AG 
TAATGGCTAGCTCCATAAGTTACGTGATAGAAGATGTGTTGATTCCTCAAGTGGT AG AATATGATAAGATTGT ATT 

65 TGCAAATTCT Cn ' ITl AGTATTrCGTATAAAG 
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CAGTAGGATTTATTTTATTGGTTAAGATAGATATAGGCATA f V 1 1 I ACTTG CTCT ATTT AT ATTGTTGTTGTTAAAA 

TTTAGAACTAGCAATGCGAATATAGAAAACTTCTCTTTCAAATATTACAAGAGAGAAGTGTTGCAA 

TTTATTTTAAATAATAAATTATTATTTAAAACCAGTATTTCTTT 

GTAGTTGTACCGATTTTTTCTATTCGATATTTTGATGGTCCGA 1 1111 1 ATGGTA l ' l 1 VI 1 1 AACTATTG CTGGTTTG 
GGTGGTA TATTGGGAAATATGCTAGCGCCAATCGTAATAAAATATTTAAAATCGAATCAAATTGTTGGTGTATTTC 
TTTTTTTGAACGGCTC 

GTTTTATGTCTAAAGGAGTCTTCAATATTATTTTTAATTCGTTGTACCAACAAATACCTCCACATCAACTT 
AGGGTAAATACTACCATTGATTCTATTATTTCTTTTGGAATGCCAATTGGTAG 

TTTGAATATTGAATTAGTGTTAATTGCTATTAGCATACCTTA ITTTTTG rTTTCT T ATA TTTTTT ATACGGATAATGG 
ATTGAAAGAATTTAGTATATATTAG 

4213.2 

ATGATGTCTAACAAAAATAAGGAAATTCTGATTTTTGCGATTCTCTATACAGTCCTCT 
ATTGCTGGCTTCTTTAATGCCATCTGCCATTGCAAATTATCTTGTTTATC 

GTTCAAGGATAGATTGATCCAACAATGGAAGGAGATTAGAAAGACTAAAAGAAAATTCITCT^ 

AGGATGGCrCTTTCTCATTCTGATGACTGTTGTCTTTGAATTTGTATCAGAGATGTTGAAGCAC 

ATGGACAAGGTCTAAATCAGTCTAATATTCAAAGTACCTTTCAAGAACAACCACTACTGATAGCTGTTTT^ 

TGTCATTGGACCTCTGGTAGAAGAAT TATITT TCCGTCAGGTCTTATTGCATTACTTGCAGGAACGGTTGTCAGGTT 

TACTAAGCATTATTCTGGTAGGACTTGTT7TTGCTCTGACT 

GCAGTTGGTTACTTAGGTGGAGGCCTTGCCTTTTCTATTATTTATGTC 

TACrTGTTCACATGTTAAGCAACAGCCTCTCCTTAATCATTTTAGCTATCAGTATAGTAAAATGA 



4224. L 

TTGAAAAAGCCAATTATCGAATTCAAAAACGTCTCTAAAGTTTTTGAAGACAGCAACACCAAGGTTCTCAAAGAC 

ATCAACTTTGAGTTGGAAGAAGGGAAATTCTACACCCTTCTAGGTGCATCTGGTTCGGGGAAATCAACTATCCTAA 

ACATTATTGCAGGTTTACTGGATGCGACGACAGGAGATATCATGCTAGACGGTGTTCGTATCAATGATATTCCAAC 

CAACAAGCGCGACGTACATACCGTCTTCCAATCCTATGCCTTGTTCCCACATATGAATGTGTTTGAAAATGTTGCC 

TTTCCACTTCGCTTGCGTAAAATTGATAAGAAAGAAATCGAGCAGCGTGTAGCGGAAGTrCTCAAGATGGTTCAGT 

TGGAAGGTTATGAAAAACGTTCCATCCGCAAACTTTCTGGAGGACAACGTCAGCGTGTGGCCATCGCCCGTGCTA 

TCATCAACCAACCCCGTGTGGTCTTGTTGGACGAGCCTTTATCAGCGCTGGAC1TGAAATTGAGAACAGACATGCA 

GTACGAATTGCGTGAA TTAC AACAACGATTGGGCATTACCTTTGTCTTTGTCACTCACGATCAGGAAGAAGCTCTT 

GCCATGAGTGACT GGAT TTTCGTTA TGAA TGATGGCGAGATTGTCCAGTCTGGAACCCCTGTGGACATCTACGATG 

AGCCAATCAACCACTTTGTTG CCAC CTTTATCGGGGAGTCAAACATCTTGCCAGGTACCATGATTGAGGACTACTT 

GGTCGAATTTAACGGCAAACGCTTTGAAGCGGTTGATGGTGGGATGAAGCCAAATGAACCTGTTGAGGTCGTTAT 

TCGTCCAGAGGACTTGCGCATTACCCITCCTGAAGAAGGCAAGCTCCAAGTTAAGGTCGATACCCAGCTTTTCCGT 

GGAGTrCATTATGAAATTATCG CCTAT GACGAACTTGGAAATGAATGGATGATCCACTCAACCCGTAAGGCTATCG 

TGGGTGAGGAAATCGGTCTGGACTTTGAACCAGAAGACATCCACATCATGCGTCTCAATGAAACCGAAGAAGAGT 

TCGATGCTCGTATTGAGGAGTACGTAGAAATCGAAGAGCAAGAAGCAGGTTTGATCAATGCAATCGAGGAGGAAA 

GAGATGAAGAAAACAAGCTCTAA 

4252.1 

ATGAAATCAATGAGAATCTTATTTTTGT^ 

GCATCTTGTCCTTTAAACAAAGTACAGCTTTTTTCATCGGAAGCATC 

AAATTATCTTTATACCCGTAAGCAAGAAGTCCATAGTGTCCTAGCCAGTAAGAAGTCGGTGAAGCTTTTTTACAGT 

ATGTTACTCTTA ATTAATT TGTTAGGAGCTGTTCTTGTTTTGTCAGATAACTTGTTCA 

AGAATTAGTTGACnTrTATTGCCATCCTTCTTTTTC 

ATACGTGCGCGAl'l'l'l'Cri'GCTATGCTGGACAGAAAAAAGACAGTGTTGGTGACTATTTTAGCAACA Cl ' l C r 
TTAAGAAATCCAATGACCAT TGTCTCACTT CTGATTTATATTGGACTGGGCTTGTTTm 
AAATTCGGTTAAGAAGGAAGTTTCCTTTTATGGTCATATTrrCCGAGATCTTGTATTGGTCATTGTTACGCT 
TCTTTTAG 

4252.2 

ATGGTTAAAAAAATTATTGGAATGGTGCTAGCTTTACTTTCTC 

TATTTATCAACAAGGGACAGAAACCTTAGCTAAAACCTATAAAAAAATCGGTGAAGAAACCAAGGTTATTGAAGC 

GACTGAACCTCTAACCATTCTGTTAATGGGAGTGGACACCGGAAATGTTGAACGAACTGAAACTTGGGTCGGTAG 

AAGTGATAGCATGATCTTGATGACAGTGAATCCTAAAACGAAAAAAACAACAATGATGAGTTTAGAGCGGGATAT 

TCTGACGCGCATTGAATCAGGGAATGGTCAGGCtCATGAAGCGAAACTGAACTCAGCATATGCAGATGGTGGAGC 

AGAGCTTGCTATAGAAACCATTCAAAAAATGATGAATATCCATATTGATCGCTATGTGATGGTCAATATGAGAGG 

ATTGCAAAAACTAGTGGATGCAGTAGGAGGTATTACAGTCAATAATATCCTAGGTTTCCCAATTTCTATCAGTGAC 

CAAGAAGAATTTAATACTATTTCTATCGGTGTTGGGGAGCAACATATTGGGGGAGAAGAAGCCCTAGTCTATGCA 

CGAATGCGTTACCAAGATCCTGAGGGGGATTATGGTCGTCAAAAACGTCAACGTGAAGTTATTCAAAAAGTCATG 

GAAAAAGCTCTCAGTTTAAATAGCATTGGTCATTATCAAGAGATTCTAAAAGCTTTGAGTGACAATATGCAGACC 
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AATATTGATTTGTCTGCAAAAAGTATCCCTAACTTGCT 

AGTTGCAGGGTGAAGGAGAGATACTTCAAGGTGTTTCTTACCAGATTGTTTCGAGAGCACATATGTTGGAAATGCA 

AAATCTACTCCGACGTTCTTTGGGACAAGAAGAAGTTACTCAGCTTGAAACCAATGCGGTTTTATTTG 

TTTGGCAGAGCACCTGTTGGTGATGAAGATAATTAA 

4256.2 

ATG AAA AAACAAGCCTATGTCATTATTGCTCTCACCTCCTTCCTATTTGTCl" T V IT 1'rTCTCCCACAGCTTGCTGG A 

AATACTTGATTTTGACTGGTCTATCTTTTrGCACGATGTCGAAAAAACAGAAAAATTTGTC^ 

TCAGCATGTCCATGACCTGTCTCTTAGCCCTGTTTTGGCGAGGGATCGAAGAGCrrTCrCTAAGAAAAATGCAGGC 

TAATCTCAAGCGTTTATT AGCAG GGCAAGAAGTGGTTCAGGrrGCAGATCCAGATTTGGATGCCAGTTTCAAGTCC 

TTATCAGGTAAACTTAACCTTTTGACAGAGGCTCTTCAAAAAGCTGA-AAATCAGAGCCTTGCTCAGGAAGAGGAA 

ATCATCGAGAAGGAACGGAAGCGAATTGCTCGGGATTTGCACGATACAGTCAGTCAGGAGTTGTTTGCGGCCCAC 

ATGATTTTATCGGGTATCAGTCAGCAGGCTTTGAAATTGGATAGAGAAAAGATGCAGACCCAGTTGCAGAGTGTC 

ACAGCTATTTTAGAAACAGCCCAGAAGGATTTGCGGGTTTTGCTCITGCATT^ 

AGAGCITGATAGAAGGGATTCAAATTCTTTTAAAAGAGCrrGAGGACAAGAGTGATCTTAGGGTTAGTCTCAAGC 

AGAATATGACGAAATTGCCTAAGAAAATCGAGGAGCATATCTTCCGTATCCTGCAAGAGTTGATTAGCAATACCC 

TCCGCCATGCCCAGGCATCTTGCCTAGATGTCTACCTCTATCAGACAGATGTTGAATTGCAACTGAAGGTGGTGGA 

CAATGGGATTGGTTrCCAGTTAGGGAGCTTAGACGACTTGAGTTATGGACTGCGAAATATCAAGGAGCGGGTTGA 

AGATATGGCTGGAACAGTTCAACTCTTGACAGCTCCCAAGCAAGGGCTGGCGGTTGATATCCGTATTCCCCTGTTA 

GATAAGGAATGA 

4263.1 

ATGATTGTTTCCATTATTrCTCAAGGATTTGTCTGGGCTATTCTAGGTCTGGGAATCTTTATGACAT^ 
AAACTITCCAGATATGACGACAGAAGGTTCCITCCCrcnTGGGGGAGCTGTTGCT 

GTGAACCCA ITTT I AG CG AC ACTTGTTGCTGTAGG AG CAGGTTGTTTGGCTG G AATGGC AG C AG GCC tTCTTI ATA 
CAAAAGGGAAGATCCCAACCn*GCTCTCAGGGATTTTGGTGATGACrTCrTGTCACTCAATCATGCrCTTGATTAT 
GGGACGTGCGAATTTAGGCCTGCTTGGAACCAAGCAAATTCAGGATGTTTTGCCrTTTGATTC 
CTCTTGACAGGTCTCATCrTTGTGAGTATTGTTATTGCTCTCATGCTCTTTTT 

CT AT ATTGCTACAGGGGATAATCCTGATATGGCTAGAAGTTTCGGGATTCATACTGGACGCATGGAGCTCATGGGC 

TTGGTCTTATCAAATGGTGTGATTGCCCTTGCAGGTGCCCTCATTGCTCAGCAAGAAGGTTATGCCGATGTGTCTC 

GAGGGATCGGGGTTATCGTTGTGGGGCTTGCAAGTTTGATTATTGGAGAAGTTATTTTCAAGAGTTTGAGC^ 

AGAGCGTTTGGTTACTATCGTTGTAGGTTCTATCGCTTATCAATTTTTAGTGTGGGCAGTTATCGCAC^ 

AT AC AAGTT ACCTTCGTTT AT A C A GTG C CTTG A TTTT A G C A GT CTG CCTC ATG ATTCC AA C ATTT AAG C AAA C AAT 

CTTGAAAGGAGCCAAGTTAAGCAAATGA 

4346.1 

ATGAAAAAAATGA^AGTTTGGTCTACTGTACITGCAAC^ 

GTrCAAATTCTACGACTGCTTCTTCATCTGAAGAAAAAGCTGATAAAAGTCAAGAAT^ 

AGTCrCAAATGGTCGTGGTGATTGGTrAACTGCTAAAGCAAAAGAAGCTGGTTTTAATATAAAAATGGT^ 

GCTGGCGCrCAATTAGCAGACCGTGTTATTGCTGAGAAGAATAATGCAGTTGCAGATATGGTATTTGGAATTGGTG 

CTGTTGATTCAAATAAAATTAGAGATCAAAAATTACTAGTACAGTACAAGCCrAAATGGTTAGATAAAATTGATC 

AATCITTATCAGATAAAGATAATTATTATAATCCTGTGATTGTTCAACCATTAGTTTT 

AAAGAAATGCCTAAAGATTGGACTGAArrAGGTAGTAAGTATAAAGGTAAATATTCAATTrcrGGTC 

GGT AC AGG ACGGG CAATTCT AGC AAGT ATCTT AGTTCG AT ACCTTG ATG AT AAAGGTG AATT AGGTGTTTCCG AAA 

AAGGTTGGGAAGTAGCAA^AGAATATTTGAAAAATGCATACACTCITCAAAAGGGAG 

TGTTAGACAAj\GAAGATCCAATACAATATGGAj\TGATGTGGGGTTCTGGTGCATTAGTTGGACAAAAAGAACAAA 

ATGTTGTTTTCAAAGTTATGACTCCTGAGATTGGTGTACCATITGTAACTGAACAAACTATGGT^ 

TAAAAAACAAGCGTTAGCTAAAGAATTTATTGATTGGTTTC 

CTrrGGATCTATTCCTGCAAATAAAGATGCCCrCAAAGATCTACCTGAAGATACGAAGAAj\TT^ 

AAACCACAAAATATTGACTGGGAAGCTGTTGGAAAGCATrrGGATGAATGGGTAGAAAAAGCrGAArrAG 

GTACAATAA 

4346.2 

ATGATTAAATTrGATAATATTCAAATTAAATATGGTGATTTTGTTGCAATTGATAATCTGAATTTAGATATACATG 
AAGGGGAATTTrrTACATTTCn-GGGCCTTCAGGATGTGGTAAATCAACT 

AGATCCATCATCAGGAAGTATTGAAGTTAATGGAACAGATGTCACrCATTTGGAACCTGAAAAGCGTGGAATTGG 

T ATTGT ATTTC AATCTT ATGCG CT ATTTCC AACT ATG ACTG I " I " I ' l'l G AT AAT ATTGCATTTGGTTTAAAAGTT AAG A 

AGGTAGCTCCAGATGTTATTAAAGCrAAAGTATCAGCAGTGGCAGCAAAAATTAAGATCTCTGATCAACAGTTAC 

AGCGTAATGTATCAGAATTATCTGGGGGTCAACAACAAAGGGTAGCATTGGCrCGTGCTCTGGTTCTTGAACCTAA 

AATTCITTGTCrAGATGAACCATTGTCAAACCTTGACGCAAAATTACGTGTAGATTTGAGAAAAGAGTTG 

CnTCAAAAAGAGTTAGGTATTACTACTTTATATGTTACrCATGATCAAGAGGAAGCCTTGACTTrATCTG 

TTGCAGTCTTTAACAATGGATACATCGAACAGGTCGGTACACCAGTAGAGAT^ 

TGTATGTGATTTTATTGGAGATATTAATGTTTTGACCGATGAAACAGTCCACGAAGTATTATTGAAAAATACAAGC 

GTTTTCTTAGAGGATAAAAAAGGATACATTCGATTAGAGAAAGTTCGATTCAATCGTGAAA 

TTCTAAAAGGGACAATTATTGATGTTGAGTTTTCTGGAGTTACAATTCACTATACAj\TAAAA^ 
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GATrCTTAATGTAACAAGTATTGATAGTCAGGCTGCTATTAGATCTGTCGGAGAAAGTGTGGAATTATTTATCACA 
CCATCAGACGTTCTGCAATTTTAA 

4346.3 

ATGCGTCATAAATTAAATTTAAAAGATTGGCTTATTCG 

TCCAj\ACTTTGATCTAGTAGTGAATGTATTTGTAAAAGGAGGAGAATTTTCCCTTGA 
AAATCTCAGAGGGCACTTCAGAGTATTATGAACAGTTTTAAGTTAGCATTTTCACTCA^ 
TAGGTATTCTTTGTGTTCTATTTACAGAGTACTTTGATATTAAAGGTC 
TCTTTAATTTATGGAGGAGTGGTTTTAGCGACTGGATA 

TTTACAAAATGTTATCCCTTCTTTAGACCCTAACTGGTTTATTGGGTATGGTGCAGTCT^ 

CAGGAACTGCTAATCATACATTGTTTTTAACAAATACAATTCGAAGCGTTGACTATCACACTATTGAGGCTGCTCG 

AAATATGGGAGCAAAACCATTTACTGTTTTCCGAAAAGTAGTGTTACCAACCTTAATTCCAACTCTATT^ 

ACTATTATGGlTn ICl'l AGTGGTTTATCTGCAGTAGCAGCACCCATGATTGTTGGTGGTAAAGAATTTCAAACTAT 

AAATCCAATGATTATTACATTTGCAGGGATGGGGAATTCTCGTGATTTAGCTGCCCTACTTGCAATTATTTT 

ATTGCAACTACAATTTTGCTTACTATCATGAATAAGATAGAAAAAGGTGGAAATTATATT^ 

AAGCGCCrCTTAAAAAACAAAAAATTGCGTCTAAGCCnTGGAATATC 

CACAGTTTTCATGCTTCCACTAA III 11 ATAGTATTATACTCATTTACAGATCCAGTTGCAATTCAAACAGGTAACT 
TAACATTATCAAACTTTACITTAGAAAATTATCGCTTATTCTTTAOT 

AGCTTTATTTATTCTATTATTGCTGCGACAACAGCAACAATTCTCGCAGTTGTATTTGCTCGTGTTG 
TAAATCTCGTTTTGA'I I'LC I L ATTTGAATATGGTGCTCTACTTCCTTGGTTACTACCAAGTACACTTTTAGCAGTAA 
GTTTATT ATTT ACTTTT A ATC AG CCA C AATTTCTTGTCTTG AATC AG ATTTTGGT AG GT AGTTTG GT A ATTCT ACTT 
ATTGCATATATAGTTGTAAAAATCCCATTTTCTTATAGAATGGTACGTGCTATTTTATTTAGTGTTGATGATGAGAT 
GGAAGATGCAGCAAGAAGTATGGGTGCTTCACCll H 1 ATACTATGATGAAGGTTATCATTCCATTTATTTTACCG 
GTTGTTCTCTCTGTT ATTG CTTT AA ACTTT AACTCT 

CTAGCTCAACCATTAGGTATTACGATTCGATCTGCAGGTGATGAAACAGCAACATCTAATGCACAAGCTCTGGTAT 
TTGTTTATACAATTGTTCTGATGATTATTTCTGGAACGGTATTATACnTCACACAAAGACCGGGGCGTAAAGTAAG 
GAAATAA 
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Table 2 

MEELVTLDCLFIDRTKIEANANKYSFVWKKTTEKFSAKLQEQIQVYFQEEITPLLIKYAMFDKJCQKRGYKESAKNLANW 
HYNDKEDSYTHPDGWYYRFHHTKYQKTQTDFQQEIKVYYADEPESAPQKGLYMNERYQNLKAKECQALLSPQGHQIF 
5 AQRKIDVEPVFGQIKASLG YKRCNLRGKRQVRIDMGLVLMANNLLfCYSKMKZ 

MGKGHWNRKRVYSIRKFAVGACSVMIGTCAVLLGGNIAGESVVYADETLITHTAEKPICEEKMrVEEKADKALETKNIV 

ERTEQSEPSSTEAiASEKKEDEAVTPKEEKVSAKPEEKAPRlESQASNQEKPLKEDAKAVTIsrEEVNQMIEDRKVDFNQN 

WYFKLNANSKEAIKPDADVSTWKKLDLPYDWSIFNDFDHESPAQNEGGQLNGGEAWYRKTFKXDEKDLKKMVRLTF 

10 DGVYMDSQVYVNGQLVGHYPNGYNQFSYDITKYLQKDGRENVIAVHAVNKQPSSRWYSGSGIYRDVTLQVTDiCVHV 
EKNGTTILTPKLEEQQHGKVETHVTSKIVNTDOKDHELVAEYQIVERGGHAVTGLVRTASRTLFCAHESTSLDAILEVER 
PKLWTVLNDKPALYELITRVYRDGQLVDAKKDLFGYRYYHWTPNEGFSLNGERIKFHGVSLHHDHGALGAEENYICAE 
YRfU.KQMKEMGVNSIRTTHNPASEQTLQLAAELGLLVQEEAFDTWYGGKKPYDYGRFFEKDATHPEARKGEKWSDFD 
LRTMVERGKNNPAIFMWSIGNEIGEANGDAHSLATVKRLVKVIKDVDKTRYVTMGADKFRFGNGSGGHEKIADELDA 

15 VGFNYSEDNYKALRAKHPKWLIYGSETSSATRTRGSYYRPERELKHSNGPERNYEQSDYGNDRVGWGKTATASWTFD 
RDKAGYAGQFIWTGTDYIGEPTPWHNQNQTPVKSSYFGIVDTAGIPKHDFYLYQSQWVSVTCKKPMVHLLPHWNWENK 
ELASKVADSEGKIPVRAYSNASSVELFLNGKSLGLKTFNKKQTSDGRTYQEGANANELYLEWKVAYQPGTLEAIARDES 
GKEIARDKITTAGKPAAVRL1KEDHAIAADGKDLTYIYYEIVDSQGNVVPTANNLVRFQLHGQGQLVGVDNGEQASRER 
YKAQADGSWIRKAFNGKGVAIVKSTEQAGKFTLTAHSDLLKSNQVTVFTGICKEGQEKTVLGTEVPKVQTIIGEAPEMPT 

20 TVPFVYSDGSRAERPVTWSSVDVSKPGIVTVKGMADGREVEARVEVIALKSELPVVKRIAPNTDLNSVDKSVSYVLIDGS 
VEEYEVDICWEIAEEDKAKLAiPGSRiQATGYLEGQPIHATLVVEEGNPAAPAVPTVTVGGEAVTGLTSQKPMQYRTLA 
YGAKLPEVTASAKNAAVTVLQASAANGMRAStFIQPKDGGPLQTYAIQFLEEAPKlAHLSLQVEKADSLKEDQTVKLSV 
RAHYQDGTQAVLPADKVTFSTSGEGEVAIRICGMLELHKPGAVTLNAEYEGAICDQVELTlQANTEiaClAQSIRPVNVVT 
DLHQEPSLPATVTVEYDKGFPfCTHKVTWQAIPKEKLDSYQTFEVLGKVEGIDLEARAKVSVEGIVSVEEVSVTTPIAEAP 

25 QLPESVRTYDSNGHVSSAKVAWDAIRPEQYAKEGVFTVNGRLEGTQLTTKLHVRVSAQTEQGANISDQWTGSELPLAF 
ASDSNPSDPVSNVNDKLISYNNQPANRWTNWNRTNPEASVGVLFGDSGILSKR5VDNLSVGFHEDHGVGVPKSYVIEY 
YVGKTVPTAPKNPSFVGNEDHVFNDSANWKPVTNLKAPAQLKAGEMNHFSFDKVETYAVRIRMVKADNKRGTSITEV 
QIFAKQVAAAKQGQTRIQVDGKDLANFNPDLTDYYLESVDGKVPAVTASVSNNGLATVVPSVREGEPVRVLAKAENGD 
ILGEYRLHFTKDKSLI^HKPVAAVKQARLLQVGQALELPTKVPVYFTGiCDGYETKDLTVEWEEVPAENLTKAGQFTVR 

30 GRVLGSNLVAEITVRVTDKLGETLSDNPNYDENSNQAFASATNDIDKNSHDRVDYLNDGDHSENRRWTNWSPTPSSNP 
EVSAGVIFRENGKIVERTVTQGKVQFFADSGTDAPSKLVLERYVGPEFEVPTYYSNYQAYDADHPFNNPENWEAVPYR 
ADKDIAAGDEINVTFKA1KAKAMRWRMERXADKSGVAMIEMTFLAPSELPQESTQSKILVDGKELADFAENRQDYQIT 
YKGQRPKVSVEENNQVASTVVDSGEDSFPVLVRLVSESGKQVKEYRIHLTKEKPVSEICTVAAVQEDLPKIEFVEKDLAY 
KTVEKKDSTLYLGETRVEQEGKVGKERIFTAINPDGSKEEKLREVVEVPTDRIVLVGTKPVAQEAKKPQVSEKADTKPID 

35 SSEASQTNKAQLPSTGSAASQAAVAAGLTLLGLSAGLVVTKGKKEDZ 

MKIMfCKKYWTLAILFFCLFNNSVTAQEIP!CNLDGNITHTQTSE5FSESDEKQVDYSNKNQEEVDQNKFRJQIDKTELFVT 
TDKHLEKNCCICLELEPQINNDlVNSESNNLLGEDKLDNKIKENVSHLDNRGGNlEHDKDNLESSIVRKYEWDtDKVTGG 
GESYFCLYSKSNSKVSIAILDSGVDLQNTGLLKNL5NHSKNYVPNKGYLGKEEGEEGIISDIQDRLGHGTAVVAQIVGDDN 
40 INGVNPHVN1NVYRIFGKSSASPDWIVKAIFDAVDDGNDIINLSTGQYLMIDGEYEDGTNDFETFLKYKKA1DYANQKGV 
IIVAALGNDSLNVSNQSDLLKLISSRXKVRKPGLVVDVPSYFSSTISVGGIDRLGNLSDFSNKGDSDAIYAPAGSTLSLSEL 
GLNNFINAEKYKEDWIFSATLGGYTYLYGNSFAAPKVSGAIAMIIDKYKLKDQPYNYMFVKKFWKKHYQZ 

MKKTWKVFLTLVTALVAVVLVACGQGTASJCDNKEAELKKVDFILDWTPNTNHTGLYVAICEKGYFKEAGVDVDLKLP 
45 PEESSSDLVINGKAPFAVYFQDYMAiCKLEKGAGITAVAAlVEHNTSGtlSRiCSDNVSSPKDLVGKKYGTWNDPTELAML 
KTLVESQGGDFEKVEKVPNNDSNSITPIANGVFDTAWIYYGWDGILAKSQGVDANFMYLKDYVKEFDYYSPVIIANND 
YLKDNKEEARKVIQAIKKGYQYAMEHPEEAADILIKNAPELKEKRDFVIESQKYLSKEYASDKEKWGQFDAARWNAFY 
KWDKENGILKEDLTDKGFTNEFVfCZ 

50 MKRTWRNSFVTNLNTPFMlGNIEIPNRTVLAPMAGVTNSAFRTrAKELGAGLVVMEMVSDKGIQYNNEKTLHMLHIDE 
GENPVSIQLFGSDEDSLARAAEFIQENTKTDIVDINMGCPVNKIVKNEAGAMWLKDPDKIYSIINKVQSVLDIPLTVKMR 
TGWADPSLAVENALAAEAAGVSALAMHGRTREQMYTGHADLETLYKVAQALTKIPFIANGD1RTVQEAKQRIEEVGA 
DAVMIGRAAMGNPYLFNQINHYFETGEILPDLTFEDKMKIAYEHLKRLINLKGENVAVREFRGLAPHYLRGTSGAAKL 
RGAISQASTLAEIETLLQLEKAZ 

MIKNPKLLTKSFLR5FAILGGVGLVIHIAIYLTFPFYYIQLEGEKFNESARVFTEYLKTKTSDEIPSLLQSYSKSLTISAHLK 
RDIVDKRLPLVHDLDIKDGKLSNYIVMLDMSVSTADGKQVTVQFVHGVDVYKEAKNILLLYLPYTFLVTIAFSFVFSYF 
YTKRLLNPLFYISEVTSKMQDLDDNIRFDESRKDEVGEVGKQINGMYEHLLKVIYELESRNEQrVKLQNQKVSFVRGAS 
HELKTPLASLRIILENMQHNIGDYKDHPKYIAKSINKIDQMSHLLEEVLESSKFQEWTECRETTVKPVLVD1LSRYQELAH 
60 SIGVTIENQLTDATRVVMSLRALD^CVLTNHSNAIKYSDICNGRVIISEQDGYLSIK^^^CAPL5DQELEHLFDIFYHSQIVTD 
KDESSGLGLYtVNNILESYQMDYSFLPYEHGMEFKiSLZ 

MYLGDLMEKAECGQFSII^FLLQESQTTVKAVMEETGFSKATLTKYVTLLNDKALDSGLELAIHSEDENLRLSIGAATK 
GRDJRSLFLESAVKYQILVYLLYHQQFLAHQLAQ ELVIS EATLGRHLAGLNQILSEFDLSIQNGRW RGPEHQIHYFYFCL 
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FRKVWSSQEWEGHMQKPERKQEIANLEEICGASLSAGQKLDLVLWAHISQQRLRVNACQFQVIEEKMRGYFDNIFYLR 

LLRKVPSFFAGQHIPLGVEDGEMMIFFSFLLSHR1LPLHTMEYILGFGGQLADLLTQLIQEMKKEELLGDYTEDHVTYEL 

SQLCAQVYLYKGYILQDRYKYQLENRHPYLLMEHDFKETAEEIFHALPAFQQGTDLDKKILWEWLQLIEYMAENGGQ 

HMRIGLDLTSGFLVFSRMAAILKRYLEYNRFrriEAYDPSRHYDLLVTNNPIHKKEQTPVYYLKNDLDMEDLVAIRQLLF 

TZ 

MEFSKKTRELSIKKMQERTLDLLIIGGGITGAGVALQAAASGLETGLIEMQDFAEGTSSRSTKLVHGGLRYLKQFDVEV 

VSDTVSERAVVQQIAPHIPKSDPMLLPVYDEDGATFSLFRLKVAMDLYDLLAGVSNTPAANKVLSKDQVLERQPNLKK 

EGLVGGGVYLDFRNNDARLVIENIKRANQDGAL1ANHVKAEGFLFDESGKJTGVVARDLLTDQVFEIKARLVINTTGPW 

SDKVRNI^NKGTQFSQMRPTKGVHLVVDSSKIKVSQPVYFDTGLGDGRMVFVLPRENKTYFGTTDTDYTGDLEHPKVT 

QEDVDYLLGIVNNRFPESNITIDDIESSWAGLRPLIAGNSASDYNGGNNGTISDESFDNLIATVESYL5KEKTREDVESAV 

SKLESSTSEKHLDPSAVSRGSSLDRDDNGLLTLAGGKITDYRKMAEGAMERVVDILKAEFDRSFKLINSICTYPVSGGELN 

PANVDSEIEAFAQLGVSRGLDSKEAHYLANLYGSNAPKVFALAHSLEQAPGLSLADTLSLHYAMRNELTLSPVDFLLRR 

TNHMLFMRDSLDSIVEPILDEMGRFYDWTEEEKATYRADVEAALANNDLAELKNZ 

MMNELFGEFLGTLILILLGNGVVAGVVLPKTKSNSSGW'IVITMGWGIAVAVAVFVSGKLSPAYLNPAVTIGVALKGGLP 

WASVLPYILAQFAGAMLGQILVWLQFKPHYEAEENAGNILATFSTGPAIKDTVSNLISEILGTFVLVLTIFALGLYDFQA 

GIGTFAVGTLIVGIGLSLGGTTGYALNPARDLGPRIMHSILPIPNKGDGDWSYAWIPVVGPVIGAALAVLVFSLFZ 

MTKKKIER1SVIHREKILWLKWYFMRDKEQPKYSVLERXMFDAAKNQDMLAYQKYATIKQITDIRVQTSEADILEAVKE 
VYVYNHMNVIGACQRILFISQSPAYDKLNKWFNIYSDLYFSVVPLPKMGVYHEMVGIZ 

MKNSNEAEMKLLYTD[RTSLTEILTREAEELVAAGKRVFYIAPNSL5FEKERAVLEYL5QQASFS[TVTRFAQMARYLVL 

NDLPAKTTLDDIGLGLAFYKCLAELDPKDLRVYGAIKQDPQLIQQLIELYHEMTKSQMSFLDLENLTDEDKRADLLLIF 

EKVTAYLNQGQLAQESQLSHLIEAIENDKVSSDFNQIALVIDGFTRFSAEEERVVDLLHGKGVEIVIGAYASKKAYTSPFS 

EGNLYQASVKFLHHLASKYQTPAQDCSQTHEKMDSFDKASRLLESSYDFSELALDVDEKDRENLQIWSCLTQKEELEL 

VARSIRQKLHENSDL5YKHFRILLGDVASYQLSLKTIFDQYQIPFYLGRSEAMAHHPLTQFVESILALKRYRFRQEDLINL 

LRTDLYTDLSQSDIDAFEQYIRYLGINGLPAFQQTFTKSHHGKFNLERLNVLRLRILAPLETLFASRKQKAEKLLQKWSV 

FLKEGAVTKQLQDLTTTLEAVEQERQAEVWKAFCHVLEQFATVFAGSQVSLEDFLALLHSGMSLSQYRTIPATVDTVL 

VQSYDLIAPLTADFVYAIGLTQDNLPKISQNTSLLTDEERQNLNQATEEGVQLLIASSENLKKNRYTMLSLVNSARKQLF 

l^APSLFNESESKESAYLQELIHFGFRRREKRMNHKGLSKEDMGSYHSLLSSLVAYHQQGEMSDTEQDLTFVKVLSRVt 

GKKLDQQGLENPAIPTSPSSKTLAKDTLQALYPAKQEFYLSTSGLTEFYRNEYSYFLRYVLGLQEELRLHPDARSHGNFL 

HRIFERALQLPNEDSFDQRLEQAIQETSQEREFEAIYQESLEAQFTKEVLLDVARTTGHILRHNPAIETIKEEANFGGKDQ 

AFIQLDNGRSVFVRGKVDR1DRLKANGAIGVVDYKSSLTQFQFPHFFNGLNSQLPTYLAALKREGEQNFFGAMYLEMA 

EPVQSLMAVKSLAGAVVEASKSMKYQGLFLEKESSYLGEFYNKNKANQLTDEEFQLLLDYNAYLYKKAAEKILAGRF 

AINPYTENGRSLAPYVQQHQAITGFEANYHLGQARFLEKLDLADGKRLVGEKLKQAWLEKIREELtsrRZ 



MKLIPFLSEEEIQKLQEAEANSSKEQKKTAEQIEAIYTSAQNILVSASAGSGKTFVMAERILDQLARGVEISQLFISTFTVK 

AATELKERLEKKISKKIQETDDVDLKQHLGRQLADLPNAAIGTMDSFTQKFLGKHGYLLDIAPNFRILQNQSEQLILENE 

VFHEVFEAHYQGKQKETFSHLLKNFAGRGKDERGLRQQVYKIYDFLQSTSNPQKWLSESFLKGFEKADFTSEKEKLTE 

QUCQALWDLESFFRYHLDNDAKEFAKAAYLENVQULDEIGSLNQESDSQAYQAVLARVVAISKEKNGRALTNASRKA 

DLKPLADAYNEERKTQFAKLGQLSDQIAILDYQERYHGDTWKLAKTFQSFMSDFVEAYRQRKRQENAFEFADISHYTIE 

ILENFPQVRESYQERFHEVMVDEYQDTNHIQERMLELLSNGHNRFMVGDIKQSIYRFRQADPQIFNEFCFQRYAQNPQEG 

RLULKENFRSSSEVLSATNDVFERLMDQEVGEINYDNKHQLVFANTKLTPNPDNKAAFLLYDKDDTGEEEESQTETKL 

TGEMRLVIKEILKLHQEKGVAFKEIALLTSSRSRNDQILLALSEYGIPVKTDGEQNNYLQSLEVQVMLDTLRVIHNPLQD 

YALVALMKSPMFGFDEDELARLSLQKAEDKVHENLYEKLVNAQICMASSQKGLIHTALAEKLKQFMDILASWRLYAKT 

HSLYDLIWKIYNDRFYYDYVGALPNGPARQANLYALALRADQFEKSNFKGLSRFIRMIDQVLEAQHDLASVAVAPPKD 

AVELMTIHKSKGLEFPYVFILNMDQDFNKQDSMSEVILSRQNGLGVKYIAKMETGAVEDHYPKTIKLSIPSLTYRQNEEE 

LQLASYSEQMRLLYVAMTRAEKKLYLVGKGSREKLESKEYPAAKNGKLNSiNfTRJLQARNFQOWLNVAISKVFTKDKLNF 

SYRFIGEDQLTREAIGELETKSPLQDSSQADNRQSDTIKEALEMLKEVEVYNTLHRAAiELPSVQTPSQOCKFYEPVMDM 

EGVEIAGQGQSVGKKISFDLPDFSTKEKVTGAEIGSATHELMQRJDLSQQLTLASLTETLKQVQTSQAVRDKINLDKILAF 

FDTVLGQEILANTDHLYREQPFSMLKRDQKSQEDFVVRGILDGYLLYENKIVLFDYKTDRYDEPSQLVDRYRGQLALY 

EEALSRAYSIENIEKYULLGfCDEVQWKVZ 

MELARHAESLGVDAiATIPPIYFRLPEYSVAKYWNDISSAAPNTDYVIYNlPQLAGVALTPSLYTEMLKNPRVIGVKNSS 
MPVQDIQTFVSLGGEDHIVFNGPDEQFLGGRLMGARAGIGGTYGAMPELFLKLNQLIADKDLETARELQYAINAHGKL 
TSAHGNMYGVIKEVLKINEGLNIGSVRSPLTPVTEEDRPVVEAAAALIRETKERFLZ 



MYKTKCLREKLVLFLKIFFPIHYQFANYSASFVDTAMTGQYNTMDLAGVSMATSIWNPFFTFLTGIVSALVPHGHHLG 
RGKKEEVASDFYQFIYLALGLSVVLLGMVLFLAPHLNHIGLEAAVAAVAVRYLWFLSIGIIPLLLFSVIR5LLDSLGLTKL 
SMYLMLLLLPLNSGFNYLLIYGAFGVPELGGAGAGLGTSLAYWVLLGISVLVLFKQEKLKALHLEKRIPLNMDKIiCEGV 
RLGLPIGGTVFAEVAIFSVVGLIMAKFSPLIIASHQSAMNFSSLMYAFPMSISSAMAIVVSYEVGAKRFDDAKTYIGLGRW 
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TALIFA-AFTLTFLYIFRGNVASLYGNDPKFIDLTVRFLTYSLFFQLADTFAAPLQGILRGYKDTVIPFYLGLLGYWGVAIP 
VYAIZ 

WSTLAKIEALLFVAGEDGIRVRQLAELLSLPPTGIQQSLGKLAQKYHKDPDSSLALIETSGAYRLVTKPQFAEILKEYSKA 
PINQSLSRAALETLSIIAYKQPITRIEIDAIRGVNSSGALAKLQAFDLIKEDGFCKEVLGRPNLYVTTDYFLDYMGINHLEEL 
PVIDELEIQAQESQLFGERIEEDENQZ 

MDTMISRFFRHLFEALICSLKRNGWMTVAAVSSVMITLTLVAIFASVIFNTAKLATDIENNVRVVVYIRKDVEDNSQTIE 
KEGQTVTNNDYHKVYDSLKNMSTVKSVTFSSKEEQYEKLTEIMGDNWKIFEGDANPLYDAYIVEANTPNDVKTIAEDA 
KKIEGVSEVQDGGANTERLFKLASFIRVWGLGIAALLIFIAVFLISNTIRITIISRSREIQIMRLVGAKNSYIRGPFLLEGAFIG 
LLGAIAPSVLVFIVYQiyYQSVNKSLVGQNLSMISPDLFSPLMIALLFVIGVFIGSLGSGISMRRFLKIZ 

N1KKVRFIFLALLFFLASPEGAMASDGTWQGKQYLKEDGSQAANEWVFDTHYQSWFYIKADANYAENEWLKQGDDYF 

YLKSGGYMAKSEWVEDKGAFYYLDQDGKMKRNAWVGTSYVGATGAKVIEDWVYDSQYDAWFYIICADGQHAEKEW 

LQIKGKDYYFKSGGYLLTSQWINQAYVNASGAKVQQGWLFDKQYQSWFYIKENGNYADKEWIFENGHYYYLKSGGY 

P4AANEWIWDKESWFYLKFDGKMAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVYD 

SHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVYDSHSQAWYYFKSGGYMAKNETVDGYQLGSDGK 

WLGGKTTNENAAYYQVVPVTANVYDSDGEKLSYISQGSVVWLDKDRKSDDKRLAITISGLSGYMKTEDLQALDASKD 

FIPYYESDGHRFYHYVAQNASIPVASHLSDMEVGKKYYSADGLHFDGFKLENPFLFKDLTEATNYSAEELDKVFSLLNI 

NNSLLENKGATFKEAEEHYHINALYLLAHSALESNWGRSKIAKDKNNFFGITAYDTTPYLSAKTFDDVDKGILGATKWI 

KENYIDRGRTFLGNKASGMNVEYASDPYWGEKIASVMMKINEKLGGKDZ 

^KKVLQKYWAWAFVVIPLLLQAIFFYVPMFQGAFYSFTNWTGLTYNYKFVGLNNFKLLFMDPKFMNAIGFTAIIAIAM 
VVGEIALGIFtARVLNSKIKGQTFFRAWFFFPAVLSGLTVALIFKQVFNYGLPAiGNALHIEFFQTSLLGTKWGAlFAAVF 
VLLWQGVAMPIIIFLAGLQSIPTEITEAAR1DGATSKQVFWNIELPYLLPSVSMVFILALKGGLTAFDQVFAMTGGGPNN 
ATTSLGLLVYNYAFKNNQFGYANAIAVILFFLIVVISIIQLRVSKKFEIZ 

MMKQDERKALIGKYILLILGSVLILVPLLATLFSSFKPTKDIVDNFFGFPTNFTWDNFSRLLADGIGGYYWNSVVITVLSL 
tAVMIFIPMAAYSlARNMSKRKAFTIMYTLULGIFVPFQVlMlPlT^ 

SIPESLDEAAEIDGANQFTTYFRIIFPMMKPMHATTMIINALWFWNDFMLPLLVLNRDSKMWTLPLFQYNYAGQYFND 
YGPSFASYVVGIISmVYLFFQRHIISGMSNGAVKZ 

WKSILQKMGEHPMLLLFLSYSTVISILAQNWMGLVASVGMFLFTIFFLHYQS1LSHKFFRLILQFVLFGSVLSAAFASLEH 
FQ1VKKFNYAFLSPNMQVWHQNRAEVTFFNPNYYGIICCFCIMIAFYLFTTTKLNWLKVFCVIAGFVNLFGLNFTQNRT 
AFPAIIAGAIIYLFTTIKNWKAFWL^IGVFAIGL^FLFSSDLGVRMGTLDSSMEERISIWDAGMALFKQNPFWGEGPLTYM 
NSYPRIHAPYHEHAHSLYIDTIL^YGIVGTILLVLSSVAPVRLMMDMSQESGKRPIIGLYLSFLTVVAVHGIFDLALFWIQS 
G FIFLL VMCSIPLEHRML VSDMTDZ 

\ISKMDVQKIIAPMMICFVMMRGIL\LKDGMLAILPLTVVGSLFLIMGQLPFEGLNICSIASVFGANWTEPFMQVYSGTFAI 

MGLISCFSIAYSYAKNSGVEALPAGVLSVSAFFILLRSSYIPKQGEAIGDAISKVWFGGQGIIGAIUGLVVGSIYTFFKRKIV 

IKMPEQVPQALAKQFEAMIPAFVIFLSSMIVYILAKSLTNGGTFIEMIYSAIQVPLQGLTGSLYGA1GLAFFISFLWWFGVH 

GQSVVNGVVTALLLSNLDANKAMLASANLSLENGAHIVTQQFLDSFLILSGSGITFGLVVAMLFAAKSKQYQALGKVA 

AFPAIFNVNEPVVFGFPfVMNPVMFVPFILVPVLAAVIVYGAlATGFMQPFSGVTLPWSTPAlL5GFLVGGWQGVITQLV[ 

LAMSTLVYFPFFKVQDRLAYQNEIKQSZ 

MKKKDLVDQLVSEIETGKVRTLGIYGHGASGKSTFAQELYQALDSTTVNLLETDPYITSGRHLVVPKDAPNQKVTASLP 
VAHELESLQRDILACRRVWMSZ 

MKKRYLVLTALLALSLAACSQEKTKNEDGETKTEQTAKADGTVGSKSQGAAQKKAEWNKGDYYSIQGKYDEIIVAN 
KHYPLSKDYNPGENPTAKAELVKLIKAMQEAGFPISDHYSGFRSYETQTKLYQDYVNQDGKAAADRYSARPGYSEHQT 
GLAFDVIGTDGDLVTEEKAAQWLLDHAADYGFVVRYLKGKEKETGYMAEEWHLRYVGKEAKEIAASGLSLEEYYGF 
EGGDYVDZ 

MREPDFLNHFLKKGYFKKHAKAVLALSGGLDSMFLFKVLSTYQKELEIELILAHVNHKQRIESDWEEKELRKLAAEAE 

LPIYISNFSGEFSEARARNFRYDFFQEVMKKTGATALVTAHHADDQVETIFMRLIRGTRLRYLSGIKEKQVVGEIEIIRPFL 

HFQKKDFPSIFHFEDTSNQENHYFRNRIRNSYLPELEKENPRFRDAILGIGNEILDYDLAIAELSNNINVEDLQQLFSYSES 

TQRVLLQTYLNRFPDLNLTKAQFAEVQQILKSK5QYRHPIKNGYELIFCEYQQFQICKISPQADEKEDELVLHYQNQVAY 

QGYLFSFGLPLEGEL1QQIPVSRETSIHIRHRXTGDVLIKNGHRKKLRRLFIDLKIPMEKRNSALIIEQFGEIVSILGIATNNL 

SKKTKNDIMNTVLYIEKIDRZ 

MRKFLIILLLPSFLTISKVVSTEKEVVYTSKEIYYLSQSDFGIYFREKLSSPMVYGEVPVYANEDLVVESGKLTPCTSFQIT 
EWRLNKQGIPVFKLSNHQFIAADKRFLYDQSEVTPTIKKVWLESDFKLYNSPYDLKEVKSSLSAYSQVSIDKTMFVEGRE 
FLHIDQAGWVAKESTSEEDNRMSKVQEML^EKYQKDSFSIYVKQLTTGKEAGINQDEKMYAASVLICLSYLYYTQEKIN 
EGLYQLDTTVKYVSAVNDFPGSYKPEGSGSLPKKEDNKEYSLKDLITKVSKESDNVAHNLLGYYISNQSDATFKSKMSA 
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rMGDDWDPKEKLISSKMAGKFMEAIYNQNGFVLESLTKTDFDSQRIAKGVSVKVAHKIGDADEFKHDTGVVYADSPFIL 
SIFTKNSDYDTISKLA1CDVYEVLKZ 

MKKQNNGLIKNPFLWLLFIFFLVTGFQYFYSGNNSGGSQQCNYTELVQEITDGNVKELTYQPNGSVIEVSGVYKNPKTSK 

EETGIQFFTPSVTKVEKFTSTILPADTTVSELQKLATDHKAEVTVKHESSSGrSVINLLVSIVPFGILFFFLFSMMGNMGGG 

NGRNPMSFGRSICAKAJVNKEDIKVRFSDVAGAEEEKQELVEVVEFLKDPKRFTKLGAIUPAGVLLEGPPGTGICTLLAKA 

VAGEAGVPFFSISGSDFVEMFVGVGASRVR5LFEDAKKAAPAIIFIDEIDAVGRQRGVGLGGGNDEREQTLNQLLIEMDG 

FEGNEGIIVIAATNRSDVLDPALLRPGRFDRKVLVGRPDVKGREAILKVHAKNKPLAEDVDLKLVAQQTPGFVGADLEN 

VLNEAALVAARRNKSIIDASDIDEAEDRVIAGPSKKDKTVSQKERELVAYHEAGHTIVGLVLSNARVVHKVTIVPRGRA 

GGYMIALPKEDQMLLSKEDMKEQLAGLMGGRVAEEIIFNVQTTGASNDFEQATQMARAMVTEYGMSEKLGPVQYEG 

NHAMLGAQSPQKSISEQTAYEIDEEVRSLLNEARNKAAEIIQSNRETHKLIAEALLKYETLDSTQIKALYETGKMPEAVE 

EESHALSYDEVKSICMNDEKZ 

MKRSSLLVRMVISIFLVFLILLALVGTFYYQSSSSAIEATIEGNSQTTtSQTSHFIQSYlKKLETTSTGLTQQTDVLAYAENP 

SQDKVEGIRDLFLTILKSDKDLKTWLVTKSGQVISTDDSVQMKTSSDMMAEDWYQKAIHQGAMPVLTPARKSDSQW 

VISVTQELVDAKGANLGVLRLDISYETLEAYLNQLQLGQQGFAFIINENHEFVYHPQHTVYSSSSKMEAMKPYIDTGQG 

YrPGHKSYVSQEKlAGTDWTVLGVSSLEKLDQVRSQLLWTLLGASVTSLLVCLCLVWFSLKRWIAPLKDLRETMLElAS 

GAQNLRAKEVGAYELREVTRQFNAMLDQ[DQLMVAIRSQEETTRQYQLQALSSQ1NPHFLYNTLDT1IWMAEFHDSQR 

VVQVTKSLATYFKLALNQGKDLICLSDEINHVRQYLFIQKQRYGDKLEYEINE^fVAFDNLVLPKLVLQPLVENALYHGI 

KEKEGQGHIKLSVQ KQDSG LVIRIEDDGVGFQDAGDSSQSQLICRGGVGLQNVDQRLKLHFGANYHMKIDSRPQKGTKV 

EIYINRJETSZ 

MKRSSLLVRMVISIFLVFLILLALVGTFYYQSSSSAIEATIEGNSQTTISQTSHFIQS YIKKLETTSTGLTQQTDVLAYAENP 

SQDKVEGIRJDLFLTILKSDKDLKTVVLVTKSGQVISTDDSVQMKTSSDMMAEDWYQKAIHQGAMPVLTPARKSDSQW 

VISVTQELVDAKGANLGVLRLDISYETLEAYLNQLQLGQQGFAFIINENHEFVYHPQHTVYSSSSKMEAMKPYIDTGQG 

YTPGHKSYVSQEKIAGTDWTVLGVSSLEKLDQVR5QLLWTLLGA^VTSLLVCLCLVWFSLKRWIAPLKDLRETMLELAS 

GAQNLRAKEVGAYELREVTRQFNAMLDQIDQLMVAIRSQEETTRQYQLQALSSQINPHFLYNTLDTIIWMAEFHDSQR 

VVQVTKSLATYFRLALNQGKDLICLSDEINHVRQYLFIQKQRYGDKLEYEINENVAFDNLVLPKLVLQPLVENALYHGI 

KEKEGQGHIKLSVQKQDSGLVIRIEDDGVGFQDAGDSSQSQLICRGGVGLQNVDQRtKLHFGANYHMKIDSRPQKGTKV 

ElYINRIETSZ 

MFFKLLREALKVKQVRSKILFTIFrVLVFRIGTSITVPGVNANSLNALSGLSFLNMLSLVSGNALKNFSIFALGVSPYITASI 
VVQLLQMDILPKFVEWGKQGEVGRRKLNQATRYlALVLAFVQSrGnTAGFNTLAGAQLIKTALTPQVFLTIGIILTAGSMI 
VTWLGEQITDICGYGNGVSMIIFAGIVSSIPEMIQGIYVDYFVNVPSSR1TSSIIFVIILIITVLLIIYFTTYVQQAEYKIP[QYTK 
VAQGAPSSSYLPLKVNPAGVIPVIFASSITAAPAAILQFLSATGHDWANVVRVAQEMLATTSPTGIAMYALLIILFTFFYTF 
VQINPEKAAETYKRVVPISMEFVLVKVQKNICLNFFVVLQLLVPSSLVZ 

MDIRQVTETIAMIEEQNFDlRTITMGISLLDCIDPDINRAAEKIYQKrTTKAANLVAVGDEIAAELGIPIVNKRVSVTPISLIG 

AATDATDYVVLAKALDlCAAKEIGVDFIGGFSALVQKGYQKGDEILINSIPRALAETDiCVCSSVNIGSTKSGINMTAVAD 

MGRIIKETANLSDMGVAKLVVFANAVEDNPFMAGAFHGVGEADVIINVGVSGPGVVKRALEKVRGQSFDVVAETVKK 

TAFKITRIGQLVGQMASERLGVEFGIVDLSLAPTPAVGDSVARVLEEMGLETVGTHGTTAALALLNDQVFGCGGVMAC 

NQVGGL^GAFIPVSEDEGMIAAVQNGSLNLEKLEAMTAICSVGLDMIAIPEDTPAETL^MIADEAAIGVINMKTTAVRII 

PKGKEGDMIEFGGLLGTAPVMKVNGASSVDFISRGGQIPAPIHSFKNZ 

MTQIIDGKALAAKLQGQLAEKTAKLKEETGLVPGLVVILVGDNPASQVYVRNKERSALAAGFR5EVVRVPETITQEELL 
DLIAKYNQDPAWHGILVQLPLPKHIDEEAVLLAIDPEKDVDGFHPLNMGRLWSGHPVMIPSTPAGIMEMFHEYG1DLEG 
KNAVVIGRSNIVGKPMAQLLLAKNATVTLTHSRTHNLSKVAAKADILVVAIGRAKFVTADFVKPGAVVIDVGMNRDEN 
GKLCGDVDYEAVAPLASHITPVPGGVGPMTITMLMEQTYQAALRTLDRKZ 

MSKFNRIHLVVLDSVGIGAAPDANNFVNAGVPDGASDTLGHISKTVGLNVPNMAKIGLGNIPRETPLKTVAAESNPTGY 

ATKLEEVSLGKDTMTGHWEIMGLNITEPFDTFWNGFPEEILTKIEEFSGRKVIREANKPYSGTAVIYDFGPRQMETGELH 

YTSADPVLQIAAHEDIIPLDELYRICEYARSITLERPALLGR1IARPYVGEPGNFTRTANRRDLAVSPFFPTVLDKLNEAGI 

DTYAVGKINDIFNGAGINHDMGHNKSNSHGIDTLLKTMGLAEFEKGFSFTNLVDFDALYGHRRNAHGYRDCLHEFDE 

RLPEIIAAMRENDLLLITADHGNDPTYAGTDHTREYIPLLAYSPAFKGNGLIPVGHFADISATVADNFGVETAMIGESFL 

DJCLVZ 

MFISISAGIVTFLLTLVEIPAFIQFYRKAQ1TGQQMHEDVKQHQAKAGTPTMGGLVFLITSVLVAFFFALFSSQFSNNVGM 
ILFtLVLYGLVGFLDDFLKVFRKINEGLNPKQKLALQLLGGVIFYLFYERGGDILSVFGYPVHLGFFYlFFALFWLVGFSN 
AVNLTDGVDGLASISVVISLSAYGVIAYVQGQMDILLVILAMIGGLLGFFIFNHKPAKVFMGDVGSLALGGMLAAISMA 
LHQEWTLLIIGIVYVFETTSVMMQVSYFKLTGGKR1FRMTPVHHHFELGGLSGKGNPWSEWKVDFFFWGVGLLASLLT 
LAILYLMZ 

LFKKNKDILNIALPAMGENFLQMLMGMVDSYLVAHLGLIAISGVSVAGNIITIYQAIFIALGAAISSVISKSIGQKDQSKLA 
YHVTEALKITLLLSFLLGFLSIFAGKEMIGLLGTERDVAESGGLYLSLVGGSIVLLGLMTSLGAURATHNPRLPLYVSFL 
SNALNILFSSLAIFVLDMGIAGVAWGTIVSRLVGLVILWSQLKLPYGKPTFGLDKELLTLALPAAGERLMMRAGDVVtlA 
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LVVSFGTEAVAGNAIGEVLTQFNYMPAFGVATATVMLLARAVGEDDWKRVASL5KQTFWLSLFLMLPLSFSIYVLGVP 
LTHLYTTDSLAVEASVLVTLFSLLGTPMTTGTVIYTAVWQGLGNARLPFYATSIGMWCIR1GTGYLMGIVLGWGLPGIW 
AGSLLDNGFRWLFLRYRYQRYMSLKGZ 

5 MQTQEKHSQAAVLGLQHLLAMYSGSILVPIMIATALGYSAEQLTYLISTDIFMCGVATFLQLQLNKYFG1GLPVVLGVA 
FQSVAPLIMIGQSHGSGAMFGALIASG1YVVLVSGIFSKVANLFPSIVTGSVITTIGLTLIPVAIGNMGNNVPEPTGQSLLLA 
AITVUILLINIFTKGFIKSISILIGLVVGTAIAATMGLVDFSPVAVAPLVHVPTPLYFGMPTFEISSIVMMCIIATVSMVEST 
GVYLALSDrTKDPlDSTRLRNGYRAEGLAVLLGGIFNTFPYTGFSQNVGLVKLSGIKKRLPIYYAAGFLVLLGLLPKFGA 
LAQIIPSSVLGGAMLVMFGFVSIQGMQILARVDFANNEHNFLIAAVSIAAGVGLNNSNLFVSMPTAFQMFFSNGIVVASL 
10 LArVLNAVLNHKKKZ ■ 



MKDRIKEYLQDKGKVTVNDLAQALGKDSSKDFRELIKTULMERKHQIRFEEDGSLTLEIKKKHEITLKGIFHAHKNGFG 
FVSLEGEEDDLFVGKNDVNYAIDGDTVEVVIKKVADRNKGTAAEAKIIDILEHSLTTVVGQIVLDQEKPKYAGYIRSKN 

15 QKISQPlYVKKPALKLEGTEVLKVFlDKYPSKKHDFFVASVLDVVGHSTDVGIDVLEVLESMDrVSEFPEAVVKEAESVP 
DAPSQKDMEGRLDLRDEITFTIDGADAKDLDDAVHIICALKNGNLEFGVHIADVSYrVTEGSALDKEALNRATSVYVTD 
RVVPMLPERLSNGICSLNPQVDRLTQSAIMEIDKHGRVVNYTITQTVIKTSFRMTYSDVNDILAGDEEKRKEYHKIVSSIE 
LMAKLHETLENMRVKRGALNFDTNEAICILVDKQGKPVDIVLRQRGIAERMIESFMLMANETVAEHFSKLDLPFIYRIHE 
EPKAEKVQKFIDYASSFGLRIYGTASEISQEALQDIMRAVEGEPYADVLSMMLLRSMQQARYSEHNHGHYGLAADYYT 

20 HFTSPIRRYPDLLVHRMIRDYGRSKEIAEHFEQVIPEIATQSSNRERRAIEAEREVE AM KKAEYMEEYVGEEYD A VVSS IV 

KFGLFVELPNTVEGLIHITNLPEFYHFNERDLTLRGEKSGITFRVGQQIRiRVERADKMTGEIDFSFVPSEFDVIEKGLICQS 
SRSGRGRDSNRRSDKKEDKRKSGRSNDKRKHSQKDKKKKGKKPFYKEVAKKGAKHGKGRGKGRRTKZ 

MGTTGFTIIDLIILIVYLLAVLVAGIYFSKiCEMKGKEFFKGDGSVPWYVTSVSIFATMLSPISFLGLAGSSYAGSWILWFA 
25 QLGMVVAiPLTIRFILPIFARIDIDTAYDYLDKRFNSKALRIISALLFnYQLGRMSIIMYLPSAGLSVLTGIDlNILIILMGW 
AIVYSYTGGLKSVLWTDFIQGVtLISGVVLALFVLlANIKGGFGAVAETLANGKFLAANEKLFDPNLLSNSIFLIVMGSGF 
TII^SYASSQDLVQRFTTTQNIKXLNKMLFTNGVLSLATATVFYLIGTGLYVFYQVQNADSAASNIPQDQIFMYFIAYQL 
PVGrTGLILAAIYAASQSTISTGLNSVATSWTLDIQDVISKNMSDNRRTKlAQFVSLAVGLFSIGVSIVMAHSDIKSAYEWF 
NSFMGLVLGLLGGVnLGFVSKKANKQGAYAALIVSTIVMVFIKYFLPPTAVSYWAYSLISISVSVVSGYIVSVLTGNKVS 
30 A PK YTTIHD ITEIKADSS W E V RHZ 

MKFSKKYLAAGSAVIYSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 
TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQK 
QEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNM 
35 QPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHF 
IPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEVVSSLGSLSSNPSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGD 
HFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEEDGYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIK 
VRKNIZ 

40 MKKRAIVAVrVLLLIGLDQLVKSYIVQQIPLGEVRSWIPNFVSLTYLQNRGAAFSILQDQQLLFAVITLVVVIGAIWYLHK 
HMEDSFWMVLGLTLIIAGGLGNFIDRVSQGFVVDMFHLDFINFAIFNVADSYLTVGVIILLIAMLKEEINGNZ 

MNTNLASFrVGLIIDENDRFYFVQKDGQTYALAKEEGQHTVGDTVKGFAYTDMKQKLRLTTLEVTATQDQFGWGRVT 
EVRKDLGVFVDTGLPDKEIVVSLDILPELICELWPKXGDQLYIRLEVDKKDR1WGLLAYQEDFQRLARPAYNNMQNQN 
45 WPAIVYRLKLSGTFVYLPENNMLGFIHPSERYAEPRLGQVLDARVIGFREVDRTLNLSLKPRSFEMLENDAQMILTYLE 
SNGGFMTLNDKSSPDDIKATFGISKGQFKKALGGLMKAGKIKQDQFGTELIZ 

MKDVSLFLLKKVFKSRLNWIVLALFVSVLGVTFYLNSQTANSHSLESRLESRIAANERAINENEEKLSQMSDTSSEEYQF 
AKNNLDVQKNLLTRKTEILTLLKEGRWKEAYYLQWQDEEKNYEFVSNDPTASPGLKMGVDRERKIYQALYPLNIKAH 
50 TLEFPTHGIDQIVWILEVIIPSLFVVAIIFMLTQLFAERYQNHLDTAHLYPVSKVTFAISSLGVGVGYVTVLFIGICGFSFLV 
GSLISGFGQLDYPYPIYSLVNQEVTIGKIQDVLFPGLLLAFLAFIVIVEVVYLIAYFFKQKMPVLFLSLIGIVGLLFGIQTIQP 
LQRJL\HLIPFTYLRSVEILSGRLPKQIDNVDLNWSMGMVLLPCLIIFLLLGlLFIERWGSSQKiCEFFNRFZ 

MMKFILDrVSTPAILVALIAlLGLVLQKKKLPDIIKGGIKTFVGFLVVSGGAGIVQNSLNPFGTMFEHAFHLSGVVPNNEAI 
55 VAVALTTYGSATAMIMFAGMVFNILLARf^RJKYIFLTGHHTLYMACMIAVILSVAGFTSLPLILLGGLALGIIMSISPAF 
VQKYMVQLTGNDKVALGHFSSLGYWL5GFTGSLIGDKSKSTEDIKFPKSLAFLRDSTVSITLSMAVIYIIVAIFAGSEYIEK 
EISSGTSGLVYALQLAGQFAAGVFVILAGVRLILGEIVPAFKGISERLVPNSKPALDCPIVYTYAPNAVLIGFISSFVGGLVS 
MVIMIASGTVVILPGVVPHFFCGATAGVIG NASGGVRGATIGAFLQGILISFLPVFLMPVLGGLGFQGSTFSDADFGLSGII 
^ LGMLNQFGSQAGIVIGLVLILAVMFGVSFIKKPSATEEZ 

MIKTFI^ALSV1LFS!P1ITYSFFPSSNLNIWI^TQPILAQ1YAFPLATATMAAIL5FLFFFLSFYICKNKQIRFYSGILLLLSLIL 
LLFGTDKTLSSASNKTKTLKLVTWNVANQIEAQHIERIFSHFDADMAIFPELATNIRGEQENQR1KLLFHQVGLSMANYD 
IFTSPPTNSGLAPVTVIVICKSYGFYTEAKTFHTTRFGTIVLHSRKQNIPDHALHTAPPLPGLMEIWKQDLNIIHNQLASKYP 
KAIIAGDFNATMRHGALAKISSHRDALNALPPFERGTWNSQSPKLFNATIDHILLPKNHYYVKDLDfVSFQNSDHRCIFT 
65 EITFZ 
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MNPIQRSWAYVSRKRLRSFILFLILLVLLAGISACLTLM 

ENVSPELETVA1CLKDKEAVTGEQSVERDDLSAADNNLVSLTALEDSSKDVTFTSSAFNLKEGRHLQKGDSKKILIHEEL 

AKKNGL^LHDKIGLDAGQSESGKGQTVEFEIIGIFSGKKQEKFTGLSSDFSENQVFTDYESSQTLLGNSEAQVSAARFYVE 

NPKEMDGLMKQVENLALENQGYQVEKENKAFEQIKDSVATFQTFLTIFLYGMLIAGAGALILVLSLWLRERVYEVGIL 

LALGKGKSSIFLQFCLEVVLVSLGALLPAFVAGNAnTYLLQTLLASGDQASLQDTLAKASSLSTSILSFAESYVFLVLLS 

CLSVALCFLFLFRKSPKEILSSISZ 

MLHNAFAWTRKFFKSIVIFUILLMASL^LVGL^IKGATAK^ 

TENKAIESYVKRINAIGDLTGYDLIETPETKKNLTADRAKRPGSSLMITGVNDSSKEDKFVSGSYKLVEGEHLTNDDKDK 
ILLHKDLAAKHGWKVGDKVKLDSNIYDADNEKGAKETVEVTIKGLFDGH^SAVTYSQELYEOTAn*DIHTAAKLYGY 
TEDTAIYGDATFFVTADKNLDDVMKELNGISGINWKSYTLVKSSSNYPALEQSISGMYKMANLLFWGSLSFSVLLLALL 
LSLWINARRKEVGILl^IGLKQASILGQFITESIUAIPALVSAYFLANYTARAIGNTVLANVTSGVAKQASKAAQASNLGG 
GAEVDGFSKTLSSLDISIQTSDFUIFVLALVLVVLVMALASSNLLRKQPKELLLDGEZ 

MSQDKQMKAVSPLLQRVINISSIVGGVGSLIFCIWAYQAGILQSKETI^AFIQQAGIWGPPLFIFLQILQTVVPIIPGALTSV 
AGVFIYGHIIGTIYNYIGrVIGCAIIFYLVRLYGAAFVQSVVSKRTYDKYIDWLDKGNRFDRFFIFMMIWPISPADFLCMLA 
ALTKMSFKRYMTIIILTKPFTLVVYTYGLTYIIDFFWQMLZ . 

MRNMWVVIKETYLRHVESWSFFFMVISPFLFLGISVGIGHLQGSSM 

ASAKEAIKEEKLKGYLTIDQEDSVLKAVYHGETSLENGIKFEVTGTLNELQNQLNRSTASLSQEQEKRLAQTIQFTEK1DE 
AKE^KFIQTIAAGALGFFLYMILITYAGVTAQEVASEKGTKIMEVVFSSIRASHYFYARMMALFLVILTHIGIYVVGGL 
AAVLLFKDLPFLAQSGILDHLGDAISLNTLLFILISLFMYVVLAAFLGSMVSRPEDSGFCALSPLMILIMGGFFGVTALGAA 
GDNLLLKIGSYIPFISTFFMPFRTINDYAGGAEAWISLAITVIFAVVATGFIGRMYASLVLQTDDLGIWKTFKRALSYKZ 

MTETIKLMKAHTSVRRFKEQEIPQVDLNEILTAAQMASSWKNFQSYSVrVVRSQEKKDALYELVPQEAIRQSAVFLLFV 

GDLNRAEKGARLHTDTFQPQGVEGLLISSVDAALAGQNALLAAESLGYGGVIIGLVRYKSEEVAELFNLPDYTYSVFG 

MALGVPNQHHDMKPRLPLENVVFEEEYQEQSTEAIQAYDRVQADYAGARATTSWSQRLAEQFGQAEPSSTRKNLEQK 

KLLZMLKLIAIVGTNSKRSTNRQLLQYMQKHFTDKAEIELVEIKAIPVFNKPADKQVPAEILEIAAKIEEADGVUGTPEYD 

HSXPAVLMSALAWLSYGIYPLLNKPIMITGASYGTLGSSRAQLQLRQILNAPEIKANVLPDEFLLSHSLQAFNPSGDLVDL 

DVnCKLDAIFDDFRlFVKITEKLRNAQELLRKDAEDFDWENLZ 

MOTYQLNNGVEIPVLGFGTFKAKDGEEAYRAVLEALKAGYRHIDTAAIYQNEESVGQA[KDSGVPR£EMFVTTKLW 
QQTYEQTRQALEKSIEKLGLDYLDLYLIHWPNPKPLRENDAWKTRNAEVWRAMEDLYQEGKIRAIGVSNFLPHHLDAL 
LETATIVPAVNQVRLAPGVYQDQVVAYCREKGILLEAWGPFGQGELFDSKQVQEIAANHGKSVAQIALAWSLAEGFLP 
LPKSVTTSRIQANLDCFGIELSHEERETLKTIAVQSGAPRVDDVDFZ 

MRCKMLDPIAIQLGPLAIRWYALCIVTGULAVYLTMKEAPRKFCIIPDDILDFILVAFPLAILGARLYYVIFRFDYYSQNLG 
EIFAIWNGGLAIYGGLITGALVLYIFADRKLINTWDFLDIAAPSVMIAQSLGRWGNFFNQEAYGATVDNLDYLPGFIRDQ 
MYIEGSYRQPTFLYESLWNLLGFALILIFRRKWKSLRRGHITAFYLIWYGFGRMVIEGMRTDSLMFFGFRVSQWLSVVLI 
GLGIMrVIYQNRKKAPYYITEEENZ 

MGKLSSILLGTVSGAALALFLTSDKGFCQVCSQAQDFLDDLREDPEYAKEQVCEKLTEVKEQATDFVLKTKEQVESGEIT 
VDSILAQTKSYAFQATEASKNQLNNLKEQWQEKAEALDDSEEIVIDITEEZ 

MKTKUFWGSMLFLLSLSILLTIYLAWIFYPMEIQWLNLTNRVYLKPETIQYNFHILMNYLTNPFSQVLQMPDFRSSAAG 
LHHFAVVKNLFHLVQLVALVTLPSFYVFVNRIVKKDFLSLYRKSLLALVVLPVMIGLGGVLIGFDQFFTLFHQILFVGD 
DTWLFDPAKDPVIMILPETFFLHAFLLFFALYENFFGYLYLKSRRKZ 

MTYHFTEEYDIIVIGAGHAGVEASLAASRMGCKVLLATINIEMLAFMPCNPSIGGSAKGIVVREVDALGGEMAKTIDKT 

YIQMKMLNTGKGPAVRALRAQADKELYSKEMRKTVENQENLTLRQTMIDEILVEDGKVVGVRTATHQEYAAKAVIVT 

TGTALRGEIIIGDLKYSSGPNHSLASINLADNLKELGLEIGRFKTGTPPRVKASSINYDVTEIQPGDEVPNHFSYTSRDEDY 

VKDQVPCWLTYTNGTSHEIIQNNLHRAPMFTGVVKGVGPRYCPSIEDKIVRFADKERHQLFLEPEGRNTEEVYVQGLST 

SLPEDVQRDLVHSlKGLENAEMMRTGYAiEYDMVLPHQLRATLETKKISGLFTAGQTNGTSGYEEAAGQGClAGINAAL 

KIQGKPELILKR5DGYIGVMIDDLVTKGTIEPYRLLTSRAEYRLILRHDNADMRLTEMGREIGLVDDERWARFEIKKNQF 

DNEMKRLDSIKLKPVKETNAKVEEMGFKPLTDAVTAKEFLRRPEVSYQDVVAFIGPAAEDLDDKIIELIETEIKYEGYISK 

AMDQVAKMKRMEEKRIPANIDWDDIDSUTEARQKFKLINPETIGQASRISGVNPADISILMVYLEGKNRSISKTLQKSKZ 

MTKQVLLVDDEEHILKLLDYHLSKEGFSTQLVTNGRKALALAETEPFDFILLDIMLPQLDGMEVCKRLRAICGVKTPIM 
MVSAKSDEFDKVLALELGADDYLTKPFSPRELLARVKAVLRRTKGEQEGDDSDNtADDSWLFGTLKVYPERHEVYKA 
NKLLSLTPKEFESDKNPFFEVFKVSKVTAQZ 

MTTFKDGFLWGGAVAAHQLEGGWQEGGKGISVADVMTAGRHGVAREITLGVLEGKYYPNHEAIDFYHRYKEDIALF 

AEMGFKCFRTSIAWTR1FPKGDELEPNEEGLQFYDNLFDECLKNGIEPVITLSHFEMPYHLVTEYGGWKNRKLIDFFARF 

AEVVFKRYKDKVKYWMTFNEINNQANYQEDFAPFTNSGIVYEEGDNREAIMYQAAHYELVASARAVKIGHEINPDFQI 
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GCMIAMCPIYPVTCNPKDILMAMKAMQKRYYFADVHVLGKYPEHIFKYWERKGISVDFTAQDKEDLLGGTVDYIGFS 
YYMSFAIDSHRENNPYFDYLETEDLVKNNYVKASEWEWQIDPEGLRYALNWFTDHYHLPLFrVENGFGAIDQVAADG 
MVHDDYRIEYLGAHIREMKKAVVEDGVDLMGYTPWGCIDLVSAGTGEMRKRYGFIYVDKDDNGKGSYNRSPKKSFG 
WYKEVISSNGESVEZ 



MDQQNGLFGFLENHVMGPMGJCLAQFKVVRAITAAGMAAVPFTIVGSMFLVFSILPQAFSFWPIVADIFSASFDKFTSLY 
MVANYATMGSLSLYFVLSLAYELTKIYAEEEELNMNPLNGALLALMAFVMTVPQIIFDGGMMKTVTSLKEGAVIADG 
WAMGWVARFGTTGIFTAIIMArVTVLIYRMCVKHNW 
10 DtFKVUIPFGFVS^TNSWIGLMlIYLLTQLLWIVGIHGANlVFAFVSPLALANMAENAAGGHFAVAGEFSNMFVlAGGS 
GATLGLCLYIAFASKSEQLKAIGRASVVPALFNINEPLIFGLPIIYNPALAIPFILAPMVTATIYYVANSLNFIKPIIAQVPWP 
TPVGIGAFLGTADLRAVLVALVCAFAAFLVYLPF1RVYDQKLVKEEQGIZ 

MKKFYVSPIFPILVGLIAFGVLSTFIIFVNNNLLTVLILFLFVGGYVFLFKKLRVHYTRSDVEQIQYVNHQAEESLTALLEQ 
15 MPVGVMKLNLSSGEVEWFNPYAELILTKEDGDFDLEAVQTIIKASVGNPSTYAKLGEICRYAVHMDASSGVLYFVDVSR 
EQAITDELVTSRPVIGIVSVDNYDDLEDETSESDISQINSFVANFISEFSEKHMMFSRRVSMDRFYLFTDYTVLEGLMNDK 
FSVIDAFREESKQRQLPLTLSMGFSYGDGNHDEIGKVALLNLNLAEVRGGDQVVVKENDETKNPVYFGGGSAASIKRT 
RTRTRAMMTA(SDKIRSVDQVFVVGHKNLDMDALGSAVGMQLFASNVIENSYALYDEEQMSPDIERAVSFIEKEGVTK 
LI^VKDAMGMVTNRSLULVDHSKTALTLSKEFYDLFTQTtVIDHHRRDQDFPDNAVrTYIESGASSASELVTELIQFQNS 
20 KKNRLSRMQASVLMAGMMLDTKNFTSRVTSRTFDVASYLRTRGSDSIAIQEIAATDFEEYREVNELILQGRKLGSDVLI 
AEAKDMKCYDTVVISKAADAMLAMSGIEASFVLAKOTQGFlSlSAl^RSKLNVQRiMEELGGGGHFNLAAAQrKDVTLS 
EAGEKLTEIVLNEMKEKEKEEZ 

MKEKNMWKELLNRAGWILVFLLAVLLYQVPLVVTSILTLKEVALLQSGLIVAGLSIVVLALFIMGARKTKLASFNFSFF 
25 .RAKDLARLGL^YLVIVGSNILGSILLQL5NETTTANQSQINDMVQNSSLISSFFLLALLAPICEEILCRGiVPKKIFRGKENL 
GFVVGTIVFALLHQPSNLPSLLIYGGMSTVLSWTAYKTQRLEMSILLHMIVNGIAFCLLALVVIMSRTLGISVZ 

MKEKNMWKELLNRAGWILVFLLAVLLYQVPLVVTSILTLKEVALLQSGLIVAGL5IVVLALFIMGARKTKLASFNFSFF 
RAKDLARLGI^YLVIVGSNILGSILLQL^NETTTANQSQINDMVQNSSLISSFFLLALLAPICEEILCRGIVPKKIFRGKENL 
30 GFVVGTrVFALLHQPSNLPSLLIYGGMSTVl^WTAYKTQRLEMSILLHMIVNGLAFCLLALVVIMSRTLGtSVZ 

MDTQKiEAAVKMIIEAVGEDANREGLQETPARVARMYQEIFSGLGQTAEEHLSKSFEIIDDNMVVEKDIFFHTMCEHHF 
LPFYGRAHIAYIPDGRVAGLSKLARTVEVYSKKPQIQERLNIEVADALMDYLGAKGAFVVIEAEHMCMSMRGVRKPGT 
ATLTTVARGLFETDKDLRDQAYRLMGLZMKDLFLKRKQAFRKECLGYLRYVLNDHFVLFLLVLLGFLAYQYSQLLQH 
35 FPENHWPILLFVGITSVLLLLWGGTATYMEAPDKILFLLVGEEEIKLHLKRQTGISLVFWLFVQTLFLLLFAPLFLAMGY 
GLPVFLLYVLLLGVGKYFHFCQICASKFFTETGLDWDYVISQESKRKQVLLRFFALFTQVKGISNSVKRRAYLDFILKAV 
QKVPGKIWQNLYLRSYLRNGDLFALSLRLLLLSLLAQVFIEQAWIATAVVVLFNYLLLFQLLALYHAFDYQYLTQLFPL 
DKGQKEKGLQEVVRGLTSFVLLVELVVGLITFQEKLALLALLGAGLVLLVLYLPYQVKRQMQDZ 

40 MRKSrVLAADNAYLIPLETTIKSVLYHNRDVDFYILNSDIAPEWFKLLGRKMEVVNSTIRSVHIDKELFESYKTGPHINYA 
SYFRFFATEVVESDRVLYLDSDnVTGELATLFEIDLKGYSIGAVDDVYAYEGRKSGFNTGMLLMDVAKWKEHSIVNSL 
LEI^EQNQVVHLGDQSILNIYFEDNWLALDKTYNYMVGIDIYHLAQECERLDDNPPTIVHYASHDKPWNTYSISRLRE 
LWWVYRDLDWSEIAFQRSDLNYFERSNQSKKQVML\TTWSADIKHLEYLVQRLPDWHFHLAAPCDCSEELTSLSQYTN 
VTVYQNVLHSRIDWLLDDSlVYLDINTGGEVFNVVTRAQESGKKIFAFDrTRKSMDDGLYDGIFSVERPDDLVDRMKNI 

45 EIEZ 

MTKIYSSIAVKKGLFTSFLLFIYVLGSR1ILPFVDLNTKDFLGGSTAYLAFSAJVLTGGNLR5LSIFSVGI^PWMSAMILWQ 
MFSFSKRLGLTSTSIEIQDRRKMYLTLLIAVIQSLAVSLRLPVQSSYSAILVVLMNTILLIAGTFFLVWLSDLNASMGIGGSI 
VtLLSSMVLNIPQDVLETFQTVHIPTGIIVLLALLTLVFS YLLALMYRARYLVPVNKIGLHNRFKRYSYLEIMLNPAGGMP 
50 YMYVMSFI^VPAYLFILLGFIFPNHSGLAALSKEFMVGKPLWVYVYISVLFLFSIIFAFVTMNGEEIADRMKKSGEYIYGI 
YPGADTSRFINRLVLRFSVIGGLFNVIMAGGPMLFVLFDEKLLRL^^ 

MSSLSDQELVAKTVEFRQRLSEGESLDDILVEAFAVVREAD^CR1LGMFPYDVQVMGAIVMHYGNVAEM^^^GEGKTLT 
ATMPVYLNAFSGEGVMVVTPNEYI^KRDAEEMGQVYRFLGLTIGVPFTEDPKKEMKAEEKKLIYASDIIYTTNSNLGFD 

55 YLNDNLASNEEGKFLRPFNYVHDEIDDILLDSAQTPLIIAGSPRVQSNYYAIIDTLVTTLVEGEDYIFICEEKEEVWLTTKG 
AKSAENFLGIDNLYKEEHASFARHLVYAIRAHKLFTKDKDYHRGNEMVLVDKGTGRLMEMTKLQGGLHQAIEAKEHV 
KLSPETRAMASITYQSLFKMFNKISG MTGTGKVAEKEFIETYNMSVVRJPTNRPRQR1DYPDNLYITLPEKVYASLEYIKQ 
YHAKGNPLLVFVGSVEMSQLYSSLLFREGIAHNVLMANNAAREAQIISESGQMGAVTVATSMAGRGTDIKLGICGVAEL 
GGLrvlGTERMESQRIDLQtRGRSGRQGDPGMSKFFVSLEDDV[KKFGPSWVHKKYKDYQVQDMTQPEVLKGRKYRKL 

60 VEKAQHASDSAGRSARRQTLEYAESMNlQRDIVYKERNRLrDGSRDLEDVVVDHERYTEEVAADHYASRELLFHFIVTN 
ISFHVKEVPDYIDVTDKTAVRSFMKQVIDKEl^EKiCELLNQHDLYEQFLRLSLLKAlDDNWVEQVDYLQQLSMAlGGQS 
ASQKNPIVEYYQEAYAGFEAMKEQIHADMVRNLLMGLVEVTPKGEIVTHFPZ 

MIGTFAAALVAVLANFTVPIErTPNSAOTELAPPDGlGQVLSN^ 
65 QELLKTUDVn^SKIVEWIINLAPFGILGLVFKTlSDKGVGSLANYGILLVLLNnTTMLFVAPVVNPLIAFFFMRRiNPYPLVW 
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NCLRVSGVTAFFTRSSATNIPVNMKLCHDLGLNPDTYSVSIPLGSTINMAGVAlTINLLTLAAVNTLGrPVDFATAFVLSV 
VAAISSCDASGIAGGSLLLIPVACSLFG1SNDIA1QIVGVGFVIGVIQDSCETALNSSTDVLFTAVAEYAATRKKZ 



MSISQRTTKLIl^TCLACLLAYFLNLSSAVSAGIIALI^ 

LAFYVPLAYKMGWEIGITPSTVLVSHLLVQESTSPDLLVNEFLLFAIGTGFALLVNLYMPSREEEIQHYHTLVEEKLKDI 

LQ^KYYl^RGIXSRNRAQLVAELDTLLKEALRLVYLDHSDHLFHQTDYHIHYFEMRQRQSRILRNMAQQINTCHLAAS 

ESLILAQLFSKIAGQLSQTNPASDLLDEIERYLEVFRNR5LPKTREEFETRATLLQLLREAKTFIQVKVDFYQKYRQZ 

MEIMSLAIAVFAVIIGLVIGYVSISAKMKSSQEAAELMLLNAEQEATNLRGQAEREADLLVNEAKRESKSLKJCEALLEAK 

EEARKYREEVDAEFKSERQELKQIESRLTERATSLDRKDDNLTSKEQTLEQKEQSISDRAKNLDAREEQLEEVERQKEAE 

LERIGALSQAEARDULAQTEENLTREIASRIREAEQEVKERSDKMAKDILVQAMQRIAGEYVAESTNSTVHLPDDTMKG 

RIIGREGRNIRTFESLTGVDVIIDDTPEVVTLSGFDPIRREIARMTMEMLLKDGRIHPARIEELVEKNRQEIDNKIREYGEA 

AAYEIGAPNLHPDLMKIMGRLQFRTSYGQNVLRHSIEVAKLAGIMASELGENAALARRAGFLHDIGKAIDHEVEGSHVE 

[GMELARKYKEPPVVVNTIASHHGDVEAESVIAVIVAAADALSAARPGARSE5LESYKRLHDLEEIANGFEGVQTSFAL 

QAGREIRJMVNPGKIKDDKVTILAHKVRKKIENNLDYPGNIKVTVIRELRAVDYAKZ 

MMLKPSIDTLLDKVPSKYSLVILEAKRAHELEAGAPATQGFKSEKSTLRALEEIESGNVTIHPDPEGKREAVRRR1EEEKR 
RKEEEEKKIKEQIAKEKEDGEKIZ 

MSAYQLPTVWQDEASNQGAFTGLNRPTAGARFEQNLPKGEQAFQLYSLGTPNGVKVTILLEELLEAGFKEAAYDLYKI 
AIMDGDQFGSDFVKLNPNSKIPALLDQSGTENVRVFESAHILLYLAEKFGAFLPSNPVEKVEVLNWLFWQAGAAPFLG 
GGFGHFFNYAPEKLEYPINRFTMEVKRQLDLLDKELAQKPYIAGNDYTIADIAIWSWYGQLVQGKLYQGSAKFLDASS 
YQNLVKWAEKIANRPAVKRGLEVTYTEIKZ 

LASLITSIIMFYVGFDVLRDTIQKILSREETVIDPLGATLGIIS/^ 

GTAlAILASSFNYPIVDKLVAiHTFFILKTAYDIFIESSFSLSDGFDDRLLEDYQKAIMEIPKISKVKSQRGRTYGSNIYLDrr 
LEMNPDLSVFESHEIADQVESMLEERFGVFDTDVHIEPAPIPEDEILDNVYKKLLMREQLIDQGNQLEELLTDDFVYIRQ 
DGEQMDKEAYKTFCKELNSAIKDIQrrSISQKTKLICYELDGIIHTSIWRRHETWQNIFHQETKKEZ 

MTKLVATDMDGTFLDGNGRFDMDRLKSLLVSYKEKGIYFAVASGRGFLSLEKLFAGVRDDIIFIAENGSLVEYQGQDL 
YEATMSRDFYLATFEKLICTSPYVDrNKLLLTGKKGSYVLDTVDETYLKVSQHYNENIQKVASLEDrrDDfFKFTTNFTEE 
TLEDGEAWVNENVPGVKAMTTGFESIDIVLDYVDKGVAIVELVKKLGITMDQVMAFGDNLNDLHMMQVVGHPVAPE 
NARPEILELAKTVIGHHKERSVIAYMEGLZ 

MADIKLlALDLDGTLLTTDKRLTDRTKETLQAARJ)RGIKVVLTTGRPLKAMDFFLHELGTDGQEDEYTrrFNGGLVQK 
NTGEILDKTVFSYDDVARLYEETEKLSLPLDAISEGTVYQIQSDQESLYAKFNPALTFVPVDFEDLSSQMTYNKCVTAFA 
QEPLDAAiQKISPELFDQYEIFKSREMLLEWSPKNVHKATGLAKLISHLGIDQSQVMACGDEANDLSMIEWAGLGVAM 
QNAVPEVKAAANVVTPMTNDEEAVAWAIEEYVLKENZ 

MESLULLLIANLAGLFLWQRQDRQEKHLSKSLEDQADHLSDQLDYRFDQARQASQLDQICDLEVVVSDRLQEVRIELH 

QGLTQVRQEMTDNLLQTRDKTDQRLQALQESNEQRLEQMRQTVEEKLEKTLQTRLQASFETVSKQLESVNRGLGEMQ 

TVARDVGALNKVLSGTKTRGILGELQLGQ1IEDIMTPAQYEREYATVENSSERVEYAIKLPGQGDQEYVYLPIDSKFPLA 

DYYRLEEAYETGDKDEIERCRKSLLASVKRFARDIRNKYIAPPRTTNFGVLFVPTEGLYSEIVRNPVFFDDLRREEQHVA 

GPSTLSALLNSLSVGFKTLNIQICSADHISKTLASVKTEFGKFGGILVKAQKHLQHASGNIDELLNRRTIAIERTLRHIELSE 

GEPALDLLHFQENEEEYEDZ 

MKISHMKKDELFEGFYL1KSADLRQTRAGKNYLAFTFQDDSGEIDGKLWDAQPHNIEAFTAGKVVHMKGRREVYNNT 
PQVNQITLRLPQAGEPNDPADFKVKSPVDVKEIRDYMSQMIFKIENPVWQRIVRNLYTKYDKEFYSYPAAKTNHHAFET 
GLAYHTATMVRLADAISEVYPQLNKSLLYAGIMLHDLAKVIELTGPDQTEYTVRGNLLGHIALIDSErTKTVMELGIDDT 
KEEVVLLRHVII^HHGLLEYGSPVRPRIMEAEIIHMIDNLDASMMMMSTALALVDKGEMTNKIFAMDNRSFYKPDLDZ 

MSEICAKKGFKMPSSYTVLLIIUIMAVLTWRPAGAFIEGIYETQPQNPQGIWDVLMAPIRAMLGTHPEEGSLIKETSAAID 

VAFFILMVGGFLGIVNKTGALDVGIASlVKKYKGREKxMLILVLMPLFALGGTTYGMGEETMAFYPLLVPVMMAVGFDS 

LTGVAIILLGSQIGCLASTLNPFATGLASATAGVGTGDGIVLRLIFWVTLTALSTWFVYRYADKIQKDPTKSLVYSTRKED 

LKHFNVEESSSVESTLSSKQKSVLFLFVLTFILMVLSFIPWTDLGVTIFDDFNTWLTGLPVIGNIVGSSTSALGTWYFPEG 

AMLFAFMGILIGVIYGLKEDOSSFMNGAADLLSVALIVAIARGrQVIMNDGMITDTILNWGKEGLSGLSSQVFIVVTYIF 

YLPMSFUPSSSGLASATMGIMAPLGEFVNVRPSL1ITAYQSASGVLNLIAPTSGIVMGALALGRINIGTWWKFMGKLVVA 

IIVVTIALLLLGTFLPFLZ 

MSNSFVKLLVSQLFANLADIFFRVTIIANIYIISKSVIATSLVPILIGISSFVASLLVPLVTKRLALNRVLSLSQFGKTILLAIL 

VGMFTVMQSVAPLVTYLFVVAISILDGFAAPVSYAIVPRYATDLGKANSALSMTGEAVQLIGWGLGGLLFATIGLLPTT 

CINLVLYIISSFLMLFLPNAEVEVLESETNLEILLKGWKLVARNPRLRLFVSANLLEIFSNTIWVSSIILVFVTELLNKTESY 

WGYSNTAYSIGIIISGLIAFRLSEKFLAAKWEPQLFTPNLKTIQNPCLSLDPGWFLFSPNGCFLLDKKEFPLYGISVEKNTK 

RKETHMNSLPNHHFQNKSFYQL5FDGGHLTQYGGLIFFQELFSQLKLKERISKYLVTNDQRRYCRYSDSDILVQFLFQLL 
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TGYGTDYACKELSADAYFPKLLEGGQLASQI^LSRFl^RTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDSTHFTTY 

GKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEEADSFITPVLERFNQLLFRMDSGFATPKLYDLIE 

KTGQYYLIKLKKNTVLSRLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNLFYDVISLVTNMTS 

GTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTLIKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTDCRFRHLFL 

HVVGKCVRTGRKQLLKLSSLYAYSELFSALYSRJRKVNLNLPVPYEPPRRKASLMMH2 

MMEFFQQLPHLEPYGNPQYFVYVIAATLPIFIGLFFKKRFAWYEVLVSLFFIVTMLVGGKTNQLAALGIYLCWEILLLLF 
YKHYRKSKDGKWVFYLVSFLSLLPIIFVKVQPAINGTQSLLGFLGISYLTFRSVGIVIELRDGVIKDFTLWEFLRFLLFMPT 
FSSGPIDRFKRFNENYQAIPERDELMDMLDESVRYIMWGFLYKFILAHVLGETLLPPLKNLALQSGGFFNLYALAVMYT 
FGLELFFDFAGYSMFALAISNLMGIRSPINFNKPFI^RDLKEFWNRWHMSL£FWFR£ 

VTSSMAYIVNMUMGFWHGVTWYYIAYGLFHGLGLVINDAWVRKKKTLNKERKKAGKAALPENRWIQLLGMVVTFH 
VVMLSFLIFSGFLNNLWFKKZ 

MLKRLWMIFGPVLIAGLLVFLLIFFYPTEMHHNLGAEKRS^^ 

GAHPAVLAEKYNRSYRPYLLGQGGAASLNQYFGMQQMLPQLENKQVVYVISPQWFSKNGYDPAAFQQYFNGDQLTS 
FLKHQSGDQASQYAATRLLQQFPNVAMKDLVQKLASKEELSTADNEMIELLARFNERQASFFGQFSVRGYVNYDKHV 
AKYLKILPDQFSYQAIEDVVKADAEICOTSNNEMGMENYFYNEQIKKDLiCKLKDSQKSFTYUCSPEYNDLQLVLTQFSK 
SKVNPIFIIPPVNKKWMNYAGLREDMYQQTVQK1RYQLESQGFTNIADFSKDGGEPFFMKDTIHLGWLGWLAFDKAVD 
PFL5NPTPAPTYHLNERFFSKDWATYDGDVKEFQZ 

MEKNLKALKQTTDQEGPAIEPEKAEDTKTVQNGYFEDAAVKDRTLSDYAGNWQSVYPFLEDGTFDQVFDYKAKLTG 
KMTQAEYKAYYTKGYHTDVTKINrTDNTMEFVQGGQSKKYTYKYVGKJCILTYKKGNRGVRFLFEATDADAGQFKYV 
QFSDHNVAPVKAEHFHIFFGGTSQEALFEEMDNWPTYYPDNLSGQEIAQEMLAHZ 

MKDGHLLAHHIRLLNGRJFQKLLSQDPEALYRGEQGKILAVLWNSETGCATATDIALATGLANNTLTTMIKKLEEQKL 
VIVSPCGKDKRKXYLVLTELGKSQKEVGHRVSQKLDTIFYKGFSEEEIHQFEGFQER1LANLKEKGNEVZ 

MTNLIATFQDRFSDWLTALSQHLQL^LLTLLLAILLAIPLAVFLRYHEKLADWVLQIAGIFQTIPSLAI-LGLFIPLMGIGTL 

PALTALVlYAIFPlLQ^nGLKGIDPNLQEAGIAFGMTRWERLKKFEIPLAMPVIMSGIRTAAVLIIGTATLAALlGAGGL 

GSF1LLGIDRNNASLILIGALSSAVLA1AFNFLLKVMEICAKLRTIFSGFALVALLLGLSYSPALLVQKEKENLVIAGKIGPEP 

EILANMYKLLIEENTSMTATVKPNFGKTSFLYEALKKGDIDIYPEFTGTVTESLLQPSPKVSHEPEQVYQVARDGIAKQD 

HLAYLICPMSYQOTYAVAVPKKUQEYGLKTISDLiCKVEGQLlCAGFTLEFNDREDGNKGLQSMYGLNLNVATIEPALRY 

QAIQSGDIQITDAYSTDAELERYDLQVLEDDKQLFPPYQGAPLMKEALLKKHPELERVLNTLAGKrrESQMSQLNYQVG 

VEGKSAKQVAKEFLQEQGLLKKZ 

N4MHTYLQKKIENIKTTLGEMSGGYRJIMVAAMADLGFSGTMKAIWDDLFAHRSFAQWIYLLVLGSFP 
VDWIGMICSLTGIICVIFVSEGRASNYLFGLINSVIYLILALQKGFYGEVLTTLYFTVMQPIGLLVWIYQAQFKKEKQEFV 
ARKLDGKGWTKYLStSVLWWLAFGFlYQSIGANRPYRDSITDATNGVGQILMTAVYREQWIFWAATNVFSlYLWWGES 
LQIQGKYLIYLINSLVGWYQWSFCAAKQNTDLLNZ 

MRNMKAKYAVWVAFFLNLTYAIVEFIAGGVFGSSAVLADSVHDLGDAIAIGISAFLETISNREEDNQYTLGYKRFSLLG 
ALVTAVILVTGSVLVlLENVTKILHPQPVNDEGILWLGIIAmNLLASLVVGKGKTKNESILSLHFLEDTLGWVAVILMAl 
VLRJT-DWYILDPLLSLVISFFILSKALPRFWSTLKIFLDAVPEGLDIKQVKSGLERLDNVASLNQLNLWTMDALEKNAIV 
HVCLKEMEHMETCKESIRIFLKDCGFQNITIEIDADLETHQTHKRKVCDLERSYEHQHZ 

MIEYKNVALRYTEKDVLRDVNLQIEDGEFMVLVGPSGSGKTTMLKMINRLLEPTDGNIYMDGKRIKDYDERELRLSTG 
YVLQAIALFPNLTVAENtALIPEMKGWSKEEITKKTEELLAKVGLPVAEYGHRLPSELSGGEQQRVGIVRAMIGQPKIFL 
MDEPFSALDAISRKQLQVLTICELHKEFGMTTIFVTHDTDEALICLADRJAVLQDGEIRQVANPETILKAPATDFVADLFG 

GSVHDZ 

MSAVAISAMTKVMQETHGNPSSIHGHGRQAGKLLREARQELAQLLRTfCPQHIFFTSGGTEGNNTTlIGYCLRHQEQGKH 

IITTAIEHHAVLETIDYLVQHFGFEATIIQPENQEITAQQIQKALRDDTILVSTMFVNNETGNLLPIAEIGQILKQHPAAYH 

VDAVQAIGKIPIHSEELGIDFLTASAHKFHGPKGIGFLYASSMDFDSYLHGGDQEQKKRAGTENLPAIVGMVAALKEDL 

EKQEEHFQHVQNLETAFLAELEGIQYYLNRGKHHLPYVLNIGFPGQKNDLLLLRLDLAGISISTGSACTAGVVQSSHVLE 

AMYGANSERLKESLR1SLSPQNTVEDLQTLAKTLKEIIGGZ 



MLFKLSKEKIELGLSRLSPARRIFLSFALVILLGSLLLSLPFVQVESSRATYFDHLFTAVSAVCVTGLSTLPVAHTYNIWG 

QIICLLLIQIGGLGLMTFIGVFYIQSKQKL5LRSRATIQDSFSYGETRSLRKFVYSIFLTTFLVESLGAILLSFRLIPQLGWGR 

GLFSSIFLAISAFCNAGFDNLGSTSLFAFQTDLLVNLVIAGLIITGGLGFMVWFDLAGHVGRKKKGRLHFHTKLVLLLTI 

GLLLFGTATTLFLEWNNAGTIGNLPVADKVLVSFFQTVTMRTAGFSTIDYTQAHPVTLLIYILQMFLGGAPGGTAGGLK 

rTTFFVLLVFARSELLGLPHANVARRTlAPRTVQKSFSVFIIFLMSFLIGLILLGITAKGNPPFIHLVFETISALSTVGVTANL 

TPDLGKLALSVIMPLMFMGRIGPLTLFVSLADYHPEKKDMIHYMKADISIGZ 
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MSDRTIGILGLGIFGSSVLAALAKQDMNHAfDDHAERlNQFEPVLARGVIGDITDEELLRSAGlDTCDTWVATGENLESS 

VLAVMHCKSLGVPTVIAKVKSQTAKKVLE^CiGADSVISPEYEMGQSLAQTILFHNSVDVFQLDK^fVS^VEMKIPQSWAG 

QSLSKLDLRGKYNLN[LGFREQENSPLDVEFGPDDLLKADTYILAVINNQYLDTLVALNSZ 

MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILIINDGSQDQTQEIAECLASKYPNIVRAIYQENKCHGGAVNRGLVEAS 

GRYFKVVDSDDWVDPRAYLKILETLQELESKGQEVDVFVTNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSK 

GQYTMMHSLIYRTDLLRASQF2 



MKFNPNQRYTRWSIRRLSVGVASVVVASGFFVLVGQPSSVRADGLNPTPGQVLPEETSGTKEGDLSEKPGDTVLTQAKP 
EGVTGNTNSLPTPTERTEVSEETSPSSLDTLFEKDEEAQKNPELTDVLKETVDTADVDGTQASPAETTPEQVKGGVKEN 
TKDSIDVPAAYLEKAEGKGPFTAGVNQVIPYELFAGDGMLTRLLLKASDNAPWSDNGTAKNPALPPLEGLTKGKYFYE 
VDLNGOTVGKQGQALIDQLRANGTQTYKATVKVYGNKDGFCADLTNLVATKNVDININGLVAKETVQKAVADNVKDS 
JO IDVPAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRJLLLKASDKAPWSDNGDAKNPALSPLGENVKTKGQYFYQV 
ALDGNVAGKEKQALIDQFRANGTQTYSATVNVYGNKDGKPDLDNIVATKKVTININGLISKETVQKAVADNVKDSIDV 
PAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALSPLGENVKTKGQYFYQLALD 
GNVAGKEKQALIDQFRANGTQTYSATVNVYGNKDGKPDLDNIVATFCKVTININGLISKETVQKAVADNVKTVSMFQQP 

20 U 

MKLKSYILVGYIISTLLTILVVFWAVQKMLIAKGEIYFLLGMTP/ASLVGAGISLFLLLPVFTSLGKLKEHAKRVAAKDFP 
SNLEVQGPVEFQQLGQTFNEMSHDLQVSFDSLEESEREKGLMlAQLSHDIKTPrTSIQATVEGILDGHKESEQAHYLATIG 
RQTERLNKLVEELNFLTLm-ARNQVETTSKDSIFLDKLLIECMSEFQFLIEQERRDVHLQVIPESARIEGDYAKLSRILVN 
ZD LVDNAFKYSAPGTKLEVVAKLEKDQLSISVTDEGQGIAPEDLENIFKRLYRVETSRNMKTGGHGLGLAIARELAHQLGG 
EITVSSQYGLGSTFTLVLNLSGSENKAZ 

MFGQTAQHGLTNSLKDFWIFLLNIGPQLAFFCQMLRCSRSVEQGTGNHRREFNMIQQIFSHFGMTHLGQIKLVYQESID 
DL VP AKNLPNRQD? ^ V **^ PN ^ ITIEID RQ IV HG LD LLKG RK DKEHD IKSM FRQLELAStQQ ICPINQ R VHHG IL AFGEIS 

MEHLATYFSTYGGAFFAALGIVLAVGLSGMGSAYGVGKAGQSAAALLKEQPEKFASALILQLLPGTQGLYGFVIGlLrW 
33 LQLTPELPLEKGVAYFFVALPIAIVGYFSAKHQGNVAVAGMQILAKRPKEFMKGAILAAMVETYAILAFVVSFILTLRV2 

MLKSEKQSRYQMLNEELSFLLEGETNVLANLSNASALIKSRFPNTVFAGFYLFDGKELVLGPFQGGVSCIRIALGKGVC 
GEAAHFQETVrVGDVTTYLNYISCDSLAKSEIVVPMMKNGQLLGVLDLDSSEIEDYDAMDRDYLEQFVAILLEKTAWD 
*rU FTMFEEKSZ 



MSVLEIKDLHVEIEGKEILICGVNLTLCTGEIAA1MGPNGTGKSTLSAAIMGNPNYEVTKGEVLFDGVNILELEVDERAR 
MGLFLAMQYPSEIPGITNAEFLRAAMNAGKEDDEKISVREFITKLDEKMELLNMKEEMAERYLNEGFSGGEKKRNEIL 
QLLMLEPTFALLDEIDSGLDIDALKVVSICGVMAMRGEGFGAMIITHYQRLLNYITPDVVHVMMEGRVVLSGGPELAAR 
LEREG Y AKLAEELG YD YKEELZ 



MPYKRQRSFSMALSKLDSLYMAVVADHSICNPHHQGKLEDAEQISLNNPTCGDVINLSVKFDAEDRLEDIAFLNSGCTIS 
3U TASASMMTDAVLGKTKQEILELATIFSEMVQGQKDERQDQLGDAAFL5GVAKFPQRIKCATLAWNALKKTIENQEKQZ 

MKIQDLLRKDVMLLDLQATEKTAVIDEMIFCNLTDHGYVTDFETFKEGILAREALTSTGLGDGIAMPHSKNAAVKEATV 
LFAKSNKGVDYESLDGQATDLFFMIAAPEGANDTHLAALAELSQYLMKDGFADKLRQATSADQVIELFDQASEKTEEL 

33 VQAPANDSGDFIVAVTACTTGIAHTYMAQEALQKVAAEMGVGIKVETNGASGVGNQLTAEDIRKAKAIIIAADKAVEM 
DRFDGKPLINRPVADGIRKTEELINLAL^GDTEVYRAANGAKAATASNEKQSLGGALYiGiLMSGVSQMLPFVIGGGIMI 
ALAFLIDGALGVPNENLGNLGSYHELASMFMKIGGAAFGLMLPVFAGYVAYSIAEKPGLVAGFVAGAIAKEGFAFGKIP 
YAAGGEATSTLAGVSSGFLGALVGGFIAGALVLAIKKYVKVPRSLEGAKSILLLPLLGTILTGFVMLAVNIPMAAINTAM 
NDFLGGLGGGSAVLLGIVLGGMMAVDMGGPVNKAAYVFGTGTLAATVSSGGSVAMAAVMAGGMVPPLAIFVATLLF 

OU KDKFTKEERNSGLTNIIMGL5FITEGAIPFGAADPARAIPSFILGSAVAGGLVGLTGIKLMAPHGGIFVIALTSNALLYLVS 
VLVGAIVSGVVYGYLRKPQAZ 

MANKNTSTTRRRPSKAELERKEAIQRMLISLGIA!LLIFAAFKLGAAGITLYNLIRLLVGSLAYLAIFGLL[YLFFFKWIRK 
CO QEGLLSGFFTIFAGLLLIFEAYLVWKYGLDICSVLKGTMAQVVTDLTGFRTTSFAGGGLIGVALYIPTAFLFSNIGTYFIGS 
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IULVGSLLVSPWSVYDIAEFFSRGFAKWWEGHERRKEERFVKQEEKARQKAEKEARLEQEETEKALLDLPPVDMETGE 
ILTEEAVQNLPPIPEEKWVEPEIILPQAELKFPEQEDDSDDEDVQVDFSAKEALEYKLPSLQLFAPDKPKDQSKEKKIVRE 
NIKILEATFASFGIKVTVERAEIGPSVTKYEVKPAVGVRVNRISNL^DDLALALAAKDVRIEAPIPGKSLrGIEVPNSDlATV 
SFRELWEQSQTKAENFLEIPLGKAVNGTARAFDLSKMPHLLVAGSTGSGKSVAVNGIIASILMKARPDQVKFMMVDPK 
5 MVELSVYNDIPHLLIPVVTNPRXASKALQKVVDEMENRYELFAKVGVRNIAGFNAKVEEFNSQSEYKQIPLPFIVVIVDE 
LADLMMVASKEVEDAIIRLGQKARAAGIHMILATQRPSVDVISGLIKANVPSRVAFAVSSGTDSRTILDENGAEKLLGRG 
DMLFKPIDENHPVRLQGSFISDDDVER1VNFIKTQADAOYDESFDPGEVSENEGEFSDGDAGGDPLFEEAKSLVIETQKA 
SASMIQRRLSVGFNRATRLMEELEIAGVIGPAEGTKPRKVLQQZ 

10 

MSYFKKYKFDKSQFKLGMRTFKTGIAVFLVLLIFGFFGWKGLQIGALTAVFSLRESFDESVHFGTSR1LGNSIGGLYALV 
FFLL^4TFFHEAFVm•LVVVPICTMLT^MTNVAMNNKAGVIGGVAAMLI^^LSlPSGETILYVFVRVLETFMGVFVAiIVN 
YDIDRIRLFLEKKEKZ 

15 

MNKSEHRHQURALITKNKIHTQAELQALLAENDIQVTQATLSRDIKNMNLSKVREEDSAYYVLNNGSISiCWEKRLELY 
MEDALVWMRPVQHQVLLKTLPGLAQSFGS1IDTLSFPDAIATLCGNDVCLIICEDADTAQKCFEELKKFAPPFFFEEZ 



20 MKSIKLNALSYMGIRVLNIIFPILTGTYVARVLDRTDYGYFNSVDTILSFFLPFATYGVYNYGLRAISNVKDNKKDLNRT 
FSSLFYLCLACTILTTAVYILAYPLFFTDNPIVKKVYLVMGIQLIAQ1FSIEWVNEALENYSFLFYKTAFIRJLMLVSIFLFVK 
NEHDIVVYTLVMSLSTLINYUSYFWIKRDIKLVKIHLSDFKPLFLPLTAMLVFANANMLFTFLDRLFLVKTGIDVNVSY 
YTIAQRIVTVLAGVVTGAIGVSVPRLSYYLGKGDKEAYVSLVNRGSRJFNFFHPLSFGLMVLGPNAILLYGSEKYIGGGIL 
TSLFAFRTIILALDTILGSQILFTNGYEKR1TVYTVFAGLLNLGLNSLLFFNHIVAPEYYLLTTMLSETSLLVFYIIFIHRKQL 

25 IHLGHIFSYTVRYSLFSL5FVAIYFLINFVYPVDMVINLPFLINTGLIVLLSAISYISLLVFTKDSIFYEFLNHVLALICNKFKK 
SZ 



MELFMKITNYElYKLKICSGLTNQQILKVLEYGENVDQELLLGDfADISGCRNPAVFMERYFQIDDAHLSKEFQKFPSFSIL 
30 DDCYPWDLSEIYDAPVLLFYKGNLDLLKFPKVAVVGSRACSKQGAKSVEKVIQGLENELVIVSGLAKGIDTAAHMAAL 
QNGGKTIAVIGTGLDVFYPKAMKRLQDYIGNDHLVLSEYGPGEQPLKFHFPARNRIIAGLCRGVIVAEAKMRSGSLITCE 
RAMEEGRDVFAIPGSILDGLSDGCHHLIQEGAKLVTSGQDVLAEFEFZ 



35 MKQLTVEDAKQIELEILDYIDTLCKKHNINYIINYGTLIGAVRHEGFIPWDDDIDLSMPREDYQRFINIFQKEKSKYKLLS 
LETDKNYFNNFIKITDSTTKIIDTROTKTYESGIFIDIFPIDRFDDPKVIDTCYKLESFKLLSFSICHKNIVYKDSLLKDWIRT 
AFWLLLRPVSPRYFANKIEKEIQKYSRENGQYMAFIPSKFKEKEVFPSGTFDKTIDLPFENLSLPAPEKFDTILTQFYGDY 
MTLPPEEKRFYSHEFHAYKLEDZ 

40 

^4IKINHLTITQ^fKDLRX>LVSDLTMTIQDGEKVAIIGEEG^^GKSTLLKILMGEAI^DFTIKGNIQSDYQSLAYIPQKVPEDL 
KKKTLHDYFFLDSIDLDYSILYRLAEELHFDSNRFASDQEIGNLSGGEALKIQLIHELAKPFEILFLDEPSNDLDLETVDW 
LKGQIQKTRQTVIFISHDEDFLSETADTIVHLRLVKHRKEAETLVEHLDYDSYSEQRKANFAKQSQQAANMQRAYDKT 
MEKHRRVKQNVETALRATKDSTAGRLLAKKMKTVLSQEKRYEKAAQSMTQKPLEEEQIQLFFSDIQPLPASKVLVQLE 
45 KENLSIDDRVLVQKLQLTVRGQEfaGIIGPNGVGKSTLLAXLQRLLNDKREISLGFMPQDYHKKLQLDLSPtAYLSKTGE 
KEELQKIQSHLASLNFSYPEMQHQIRSLSGGQQGKLLLLDLVLRKPNFLLLDEPTRNFSPTSQPQIRKLFATYPGGLITVS 
HDRRFLKEVCSIIYRMTEHGLKLVNLEDLZ 



50 MKPKTFYNLLAEQNLPLSDQQKEQFERYFELLVEWNEKINLTAITDKEEVYLKHFYDS IAPILQGLIPNETIKLLDIGAGA 

GFPSLPMKILYPELDVTIIDSLNKRINFLQLLAQELDLNGVHFYHGRAEDFAQDKNFRAQYDFVTARAVARMQVLSELT 
IPYLKVGGKLLALICASNAPEELLEAKNALNLLFSKVEDNLSYALPNRDPRYITVVEKKKETPNKYPRKAGMPNKRPLZ 



55 MSIKLJAVD1DGTLVNSQKEITPEVFSAIQDAKEAGVKVVIATGRPIAGVAKLLDDLQLRDEGDYVVTFNGALVQETATG 
HEnSESLTYEDYLDMEFLSRKLGVHMHAiTKDGIYTANRNIGKYTVHESTLVSMPrFYRTPEEMAGKEIVKCMFIDEPEI 
LDAAIEKIPAEFYERYSINKSAPFYLELLKKNVDKGSAITHLAEKLGLTKDETMAIGDEENDRAMLEVVGNPVVMENGN 
PEIKKIAKYITKTNDESGVAHAIRTWVLZ 

60 

MTWIILGVIALIVIFVIVSYNGLVKNRMQTKEAWSQIDVQLICRRNDLLPNLIETVKGYAKYEGSTLEKVAELRNQVAAA 
TSPAEAMKASDALTRQVSGIFAVAESYPDLKASANFVKLQEELTOTENKISYSRQLYNSVVSNYNVKLETFPSNIIAGMF 
GFKAADFLQTPEEEKSVPKVDFSGLGDZ 

65 
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MLFDQIASNKRKTWILLLVFFLLLALVGYAVGYLFIRSGLGGLVIALIIGFIYALSMIFQSTEIVMSMNGAR£VDEQTAPD 

LYHVVEDMALVAQIPMPRVFIIDDPALNAFATGSNPQNAAVAATSGLLAIMNREELEAVMGHEVSHIRNYDIRISTIAV 

ALASArTMLSSMAGRMMWWGGAGRJUlSDDDRDGNGLEUMLVVSL^ 

RNPQGMINALDKLDNSKPMSRHVDDASSALYINDPKXGGGFQKLFYTHPPISERJERLKQMZ 

MKLNIQEIRKQSEGLNFEQTLDLVDDLRARNQEILDVKDJLAVGKVQYEDRMYFLDYQLSYTIVLASSRSMEPVELVES 
YPVTEVFMEGATNQLDQEVLDDDLVLPIENGELDLAESVSDNILLNIPIKVLTAEEEAGQGFISGNDWQIMTEEEYQAQ 
KAVKKEENSPFAGLQGLFDGDEZ 

MKRQLALVVFSGGQDSTTCLFWVMQHYETVEAVTFAYGQRHHLEIQITREIAKEQGIRHHILDMSLLGQITAQPDFATl 
HISYIPDKLCVESKSLKLYLFSYRNHGDFHENCINTIGKDLVNLLDPRYLEVWGKFTPRGGISIDPYYNYGKQGTKYEGL 
AEQRLFQHDLYPEKIDNRZ 



MTETVEDKVSHSITGLDILKGIVAAGAVISGTVATQTKVFTNESAVLEKTVEKTDALATNDTVVLGTISTSNSASSTSLSA 
SESASTSASESASTSASTSASTSASESASTSASTSISASSTVVGSQTAAATEATAK1CVEEDRKKPASDYVASVTNVNLQSYA 
KRRKRSVDSIEQLLASIKNAAVFSGNTIVNGAPAINASLNIAK5ETKVYTGEGVDSVYRVPIYYKLKVTNDGSKLTFTYT 

20 VTYVNPKTNDLGNISSMRPGYSIYNSGTSTQTMLTLGSDLGKPSGVKNYITDKNGRQVLSYNTSTMTTQGSGYTWGNG 
AQMNGFFAKKGYGLTSSWTVPITGTDTSFTFTPYAARTDRIGINYFNGGGKVVESSTTSQSLSQSKSLSVSASQSASASAS 
TSASASASTSASASASTSASASASTSASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASE 
S ASTS AS AS ASTS AS ES ASTS AS AS ASTS AS AS AS TS AS G S ASTSTS AS ASTS AS AS A STS AS AS AS IS AS ES ASTS AS ES ASTST 
SASASTSASESASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTSASASASTS 

25 ASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASESASTSASASASTSASESASTSASASA 
STSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTSASESASTSTSASASTSASESASTSASASASTSASA 
SASTSASASARQVRRPQPVHLNRHQPVRQPQQVLVHQLQHQRVHRLQHQPVPRLQRQPVRQLQQVPVLQSQHQQVLQ 
PQHRQVPRLQQAHQHLNQRRQAPQLQQVPVRQPQRRQVRQPQQVLVHQLQHQRVHRLRRQPVHQSQQVPVRQLPHQ 
QVPRLQQAPVRRLQQVLAPQPQPQPVRQPQQVSQRLNRHQRVRPLQQVLAPQPQRQQVHRLQRQRVRLNRHQRVRPL 

30 QQVLAPQPQRQQVHRLQHQRVRPLQQVLAPQPQRQQVHRLQRQRVRLSQHQRVRQPQQAHQLLNLHQPVRQPQHRQ 
APQLQQVPVRQPQRRQVRRLQQVPVRQPQQVPVRQPQRRQVRRPQPVHLNRHQPVRQPQQVLVHQLQHQRVHRLQH 
QPVHQSQQVPVRQFRINKCLGFSKYZ 

35 MGVETWFYSSICWLAIGLGSVWKFPYMTAANGGGGFLLIFLISTIHGFPLLLAEFALGRSAGVSA1KTFGKLGKNNKYN 
FIGWIGAFALFILLSFYSVIGGWILVYLGiEFGKLFQLGGTGDYAQLFTSIISNPAlALGAQAAFILLNIFIVSRGVQKGIERA 
SKVMMPLLFIVFVFIIGRSLSLPNAMEGVLYFLKPDFSKLTSTGLLYALGQSFFALSLGVTVMLTYASYLDKKTNLVQSG 
IStVAMNISISIMAGLAlFQARSPFNIQSEGGPSLLFIVLPQLFDKMPFGTIFYVLFLLLFLFATVTFSVVMLEINVDNITNQD 
NSKRAKWSVILGILTFVFGIPSALSYGVMADVHIFGKTFFDAMDFLVSNLLMPFGALYLSLFTGYIFKKALAMEELHLD 

40 ERAWKQGLFQVWLFLLRFFVSSFQSSSLWSSLPNLCNQKGLE2 

MLKKWQLKDVILLAFLSIFFGGVFVGSGYVYNILSLLLTPLGLQAFANEILFGLWCMAAPLAAIFVPRVGSATIGEVLAA 
LAEVLYGSQFGLGALLSGFVQGLGSEFGFIVTKNRYESWLSLTANSIGITLVSFVYEYIKLGYYAFSLPFVLSLLVVRFISV 
45 YFFCTILVRAIVKLYHQFATGGKAZ 

MVKVATQTPIISLFLLILSLETSFIPSIALTLSVVAFCILFMLYYRRFKMLAWMIILAILPSFANYWAVQLHGDASQAVML 
GTRAFVTVCIGLVFVSSVSLKELLLYLAQKGLSRSWSYAL1VVFNSFPLIQQEIKSLKEACLLRGQELHFWSPLIYSKVLM 
50 TVFRWRHLYLRALSAHGYDEHAQLKNSYRTFYIPKKTKLIYLLFFLLLQTSLFLZ 

MRKHQLQVHKLTIl^MMIALDVVLTPIFRIEGMAPMSSVVNILAGIMMGPVYALAMATVTAFIRMTTQGIPPLALTGAT 
FGALLAGLFYKYGRKFHYSALGEILGTGIIGSIVSYPVMVLFTGSAAKLSWFIYTPRFFGATLIGTAISFIAFRFLIKQEFFK 
55 KVQGYFFSERIDZ 

MQEFTNPFPIGSSSLIHCITNEISCEMLANGILALGCKPVMADDSREVLDFTKQSQALFINLGHLSAEKEKAIRMAASYAN 
QSSLPMVVDAVGVTTSSIRKSLVKDLLDYRPTVLKGNMSEIRSLVGLKHHGVGVDASAKDQETEDLLQVLKDWCQTYP 
60 GMSFLVTGPKDLVVSKNQVAVLGNGCTELDWITGTGDLVGALTAVFLSQGKTGFEASCLAVSYLNLAAEKIVVQGMG 
LEEFRYQVLNQLSLLRRDENWLDTIKGEVYEZ 

MNHKlAILSDVHGNATALEAVlADAKNQGASEYWLLGDlFLPGPGANDLVALLKDLPITASVRGlsfWDDRVLEALDGQ 
05 YGLEDPQEVQLLRMTQYLMERMDPATIVWLRSLPLLEKJCEIDGLRFSISHNLPDKNYGGDLLVENDTEKFDQLLDAET 
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DVAVYGHVHKQLLRYGSQGQQnNPGSIGMPYFNWEALKNHRSQYAVIEVEDGELLNIQFRKVAYDYEAELELAKSKG 
LPFIEMYEELRRDDNYQGHNLELLASUEKHGYVEDVKNFFDFLZ 



WNVNQIVRlIPTLKANNRKLNETFYIETLGMKALLEESAFLSLGDQTGLEKLVLEEAPSMRTRKVEGRiOCLAfU.rVKVE 
NfPLEIEGILSKTDStHRLYKGQNGYAFEIFSPEDDLILlHAEDDIASLVEVGEKPEFQTDLASISLSKFEISMELHLPTDIESF 
LESSEIGASLDFIPAQGQDLTVDNTVTWDLSMLKFLVNELDIASLRQKFESTEYFIPKSEKFFLGKDRNNVELWFEEV2 



MKWTKIIKKIEEQ1EAGIYPGASFAYFKDNQWTEFYLGQSDPEHGLQTEAGLVYDLASVSKVVGVGTVCTFLWEIGQLD 
IDRLVIDFLPESDYPDITIRQLLTHATDLDPFIPNRDLLTAPELKEAMFHLNRJISQPAFLYSDVHFLLLGFILERIFNQDLD 
VILKDQVWKPWGMTETKFGPVELAVPTVRGVEAGIVHDPKARLLGRX4AGSAGLFSTIKDLQIFLEHYLADDFARDLNQ 
NFSPLDDKERSLAWNLEGDWLDHTGYTGTFIMWNRQKQEATIFLSNRTYEKDERAQWILDRNQVMNLIRKEEZ 



MMFCKTYNHILVWGVIFYSrCIVCFCFTPQEQSTVGVGTPGIQHLGRLVFLLTPFNSLWKLGEVSDrGQLCWIFLQNILNV 
FLFFPLIFQLLYLFPNLRKTKKVLLFSFLVSLGIECTQLILDFFFDFNRVFEIDDLWTNTLGGYLAWLLYKRLHKNKVRN 
Z 



MKIPLLTFARHKFVYVLLTLLFLALVYRDVLMTYFFFDIHAPDLAKFDGQAIICNDLLKSALDFRILQFNLGFYQSFIIPIII 
VLLGFQYIELKNKVLRLSIGREVSYQGLKRKLTLQVASIPCLIYLVTVLIIAIITYFFGTFSPLGWNSLFSDGSGLQRLLDGE 
IKSYLFFTCVLLIGIFINAIYFLQIVDYVGNVTR5AITYLMFLWLGSMLLYSALPYYMVPMTSLMQASYGDVSLMKLFTP 
YILYIVPYMVLEICYEDNVZ 



MFKVLQKVGKAFMLPIAILPAAGLLLGIGGAL^NPTTIATYPILDNSIFQSIFQVMSSAGEVVFSNLSLLLCVGLCIGLAKR 

DKGTAALAGVTGYLVMTATIKALVKLFMAEGSAIDTGVIGALVVGIVAVYLHNRYNNIQLPSALGFFGGSRFVPIVTSF 

SSiLIGFVFFVtWPPFQQLLVSTGGYISQAGPIGTFLYGFLMRLSGAVGLHHIIYPMFWYTELGGVETVAGQTVVGAQKIF 

FAQLADLAHSGLFTEGTRFFAGRFSTMMFGLPAACLAMYHSVPKNRRKKYAGLFFGVALTSFITGITEPIEFMFLFVSPV 

LYVVHAFLDGVSFFIADVLNISIGNTFSGGVIDFTLFGILQGNAKTNWVLQIPFGLIWSVLYYIIFRWFITQFNVLTPGRGE 

EVDSKEISESADSTSNTADYLKQDSLQIIRALGGSNNIEDVDACVTRLRVAVKEVNQVDKALLKQIGAVDVLEVKGGIQ 

AIYGAKAILYKNSINEILGVDDZ 

MKFRKLACTVLAGAAVLGLAACGNSGGSKDAAKSGGDGAKTEITWWAFPVFTQEKTGDGVGTYEKSIIEAFEKANPDI 
KVKLETIDFKSGPEKITTAiEAGTAPDVLFDAPGRIIQYGKNGKLAELNDLFTDEFVKDVNNENIVQASKAGDKAYMYPI 
SSAPFYMAMNKKMLEDAGVANLVKEGWTTDDFEKVLKALKDKGYTPGSLFSSGQGGDQGTRAFISNLYSGSVTDEICV 
SKYTTDDPKFVKGLEKATSWIKDNLINNGSQFDGGADIQNFANGQTSYTILWAPAQNGIQAKLLEASKVEVVEVPFPSD 
EGKPALEYLVNGFAVFNNKDDK3CVAASKKFIQFIADDKEWGPKDVVRTGAFPVRTSFGKLYEDKRMETISGWTQYYSP 
YYNTIDGFAEMRTLWFPMLQSVSNGDEKPADALKAFTEKANETIKKAMKQZ 



MQSTEKKPLTAFTVISTHLLLLTVLFIFPFYWILTGAFKSQPDTIVIPPQWFPKMPTMENFQQLMVQNPALQWMWNSVFf 
SLVTMFLVCATSSLAGYVLAKKRFYGQRJLFAIFIAAMALPKQVVLVPLVRIVNFMGIHDTLWAVILPLIGWPFGVFLM 
KQFSENIPTELLESAKIDGCGEIRTFWSVAFPIVKPGFAALAIFTFINTWNDYFMQLVMLTSRNNLTISLGVATMQAEMA 
TNYGLIMAGAALAAVPIVTVFLVFQKSFTQGITMGAVKGZ 



MKIMFKNFNNlLLNRKIVLLLRIVLMMILINHLLSTAVQKQDAVrFFKRELISIFSYNDYSEANLEIPKLLLNLSLFMVGW 
LSVILLESDLADHYHHLIRYQSSSFFDYTRKRLVVISKFFTQDLFVWFLGLLPLGIHFKTVALFFLLAQLMMLYLLLSYLI 
AUSAGAGFSFFLYFLAFVGQEWMMDHIVTVYLVLLSLLVMLIVSRLEEKFKKGZ 



MGKGEMGKGVIGLEFDSEVLVNKAPTLQLANGKTATFLTQYDSKTLLFAVDKEDFGQEIIGIAKGSIESMHNLPVNLAG 
ARVPGGVNGSKAAVHEVPEFTGGVNGTEPAVHEIAEYKGSDSLVTLTTKKDYTYKAPLAQQALPETGNKESDLLASLG 
LTAFFLG LFTLGKXREQZ 



MKKTFFLLVLGLFCLLPLSVFAIDFKINSYQGDLY1HADNTAEFRQKIVYQFEEDFKGQIVGLGRAGKMPSGFDIDPHPKI 

QAAKNGAELADVTSEVTEEADGYTVRVYNPGQEGDIVEVDLVWNLKNLLFLYDDIAELNWQPLTDSSESIEKFEFHVR 

GDKGAEKLFFHTGKLFREGTIEKSNLDYTIRLDNLPAKRGVELHAYWPRTDFASARDQGLKGNRLEEFNKIEDSIVREK 

DQSKQLVTWVLPSILSISLLLSVCFYFIYRRKTTPSVKYAKNHRLYEPPMELEPMVLSEAVYSTSLEEVSPLVKGAGKFTF 

DQLIQATLLDVIDRGNVSIISEGDAVGLRLVKEDGLSSFEKDCLNLAFSGKKEETLSNLFADYKVSDSLYRRAKVSDEKR 

IQARGLQLKSSFEEVLNQMQEGVRKRVSFWGLPDYYRPLTGGEKALQVGMGALTILPLFIGFGLFLYSLDVHGYLYLPL 
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PILGFLGLVLSVFYYWKLRLDNRDGVLNEAGAEVYYLWTSFENMLREIARLDQAELESIVVWNRLLVYATLFGYADK 
VSHLMKVHQ[QVENPD1NLYVAYGWHSTFYHSTAQMSHYASVA^^^ASTYSVSSGSGSSGGGFSGGGGGGSIGAFZ 



MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTDGTTVSQRTEAQTG 

EAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTPYQaKVDGS 

EKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTVYEQKDKSVPLDVVILLDNSNSMSNIRNKNA 

RRAERAGEATRSLIDKITSDSENRVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNYDQTSFTTNTKDYSYLKL 

TNDKNDIVELKNKVPTEAEDHDGNRLMYQFGATFTQKALMKADEILTQQARQNSQKVIFHrTDGVPTMSYPINFNHAT 

FAPSYQNQLNAFFSKSPNKDGILLSDFITQATSGEHTIVRGDGQSYQMFTDKTVYEKGAPAAFPVKPEKYSEMKAAGYA 

VIGDPINGGYrWLNWRESILAYPFNSNTAKITNHGDPTRWYYNGNIAPDGYDVFTVGIGINGDPGTDEATATSFMQSISS 

KPENYTNVTDTTKILEQLNRYFHTIVTEKKSIENGTITDPMGELIDLQLGTDGRFDPADYTLTANDGSRLENGQAVGGP 

QNDGGLLKNAKVLYDTTEKRIRVTGLYLGTDEKVTLTYNVRLNDEFVSNKFYDTNGRTTLHPKEVEQNTVRDFPIPKI 

RDVRKYPEITISKEICKLGDIEFIKVNKNDKKPLRGAVFSLQKQHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDG 

KYRLFENSEPAGYKPVQNKP1VAFQIVNGEVRDVTSIVPQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYLIG 

CMMMGGVLLYTRKHPZ 



MKSINKFLTMLAALLLTASSLFSAATVFAAGTTTTSVTVHKLLATDGDMDKIANELETGNYAGNKVGVLPANAKEIAG 

VMFVWTNTNNEIIDENGQTLGVNIDPQTFICLSGAMPATAMKKLTEAEGAKFNTANLPAAKYKIYEIHSLSTYVGEDGA 

TLTGSKAVPIEIELPLNDVVDAHVYPKNTEAKPKIDKDFKGKANPDTPRVDKDTPVNHQVGDVVEYEIVTKIPALANYA 

TANWSDRMTEGLAFNKGTVKVTVDDVALEAGDYALTEVATGFDLICLTDAGLAKVNDQNAEKTVKITYSATLNDKAI 

VEVPESNDVTFNYGNNPDHGNTPKPNKPNEMGDLTLTKTWVDATGAPIPAGAEATFDLVNAQTGKVVQTVTLTTDKN 

TVTVNG LDKNTEYKFVERSIKGYSADYQEITTAGEIAVKNWKDENPKPLDPTEPKVVTYGKKFVKVNDKDNRLAGAEF 

VIANADNAGQYLARKADKVSQEEKQLVVTTKDALDRAVAAYNALTAQQQTQQEKEKVDKAQAAYNAAVIAANNAF 

EWVADKDNENWKLVSDAQGRFEITGLLAGTYYLEETKQPAGYALLTSRQKFEVTATSYSATGQGIEYTAGSGKDDAT 

KVVNKKITIPQTGGIGTIIFAVAGAALMGIAVYAYVKNNKDEDQLAZ 



MTMQKMQKMISR1FFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRV 

QrVRDLHSWDENKI^SFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMTDQTVEPLVIVAKKTDTM 

TTKVKLIKVDQDHNRLEGVGFKLVSVARDVSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIFVTNLPLGNYRFKEVEPL 

AGYAVTTLDTDVQLVDHQLVTITVVNQICLPRGNVDFMFCVDGRTNTSLQGAMFKVMICEESGHYTPVLQNGKEVVVTS 

GKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPR1DVPDTGEETLVYLDACCHFV 

vwz 



MSHIYLSIFTSLLLMLGLVNVAQADEYLR1GMEAAYAPFNWTQDDDSNGAVKIDGTNQYANGYDVQIAKKIAKDLGKE 
PLVVKTKWEGLVPALTSGKIDMIIAGMSPTAERKQEIAFSSSYYTSEPVLLVKICDSAYASAKSLDDFNGAKITSQQGVYL 
YNLIAQIPGAKKETAMGDFAQMRQALEAGVIDAYVSERPEALTAEAANSKFKMIQVEPGFKTGEEDTAIAIGLRKNDNR 
ISQINASIETISKDDQVALMDRMIKEQPAEATTTEETSSSFFSQVAKILSENWQQLLRGAGrTLLISrVGTIIGLIIGLAIGVFR 
TAPLSE^^KVIYGLQKLVGWVL^^V'YIEIFRGTPMIVQSMVIYYGTAQAFGINLDRTLAAIRVSINTGAYMTEIVRGGILAV 
DKGQFEAATALGMTHNQTMRKIVLPQVVRNILPATGNEFVlNIKDTSVLNVISVVELYFSGha'VATQTYQYFQTFTILAV 
IYFVLTFTVTRILRJFIERRMDMDTYTTGANQMQTEDLKZ 



MTQAILEIKHLKKSYGQNEVLKDISLTVHKGEVISIIGSSGSGKSTFLRSINLLETPTDGQILYHGQNVLEKGYDLTQYREK 
LGMVFQSFNLFENLNVLEhn*lVAQTTVLKRERTEAEKIAKENLEKVGMGERYAVQAKPKQL^GGQKQRVALARAI^MN 
PDAILFDEPTSALDPEMVGEVLKIMQDLAQEGLTMIVVTHEMEFARDVSHRVIFMDKGVIAEEGKPEDLFTNPKEDRTK 
EFLQRYLK2 



MICKYQLLFKISAVFSYLFFVFSLSQLTLIVQNYWQFSSQIGNLFWIQNILSLLFIGVMIVVLVKTGHGYLFRIPRKKWLW 

YSILTVLVLVFQISFNVQTAKHVQSTAEGWAVL1GYSGTNFAELGIYIALFFLVPLMEELIYRGLLQHAFFKHSRFGLDLL 

LPSILFALPHFSSLPSLLDIFVFATVG1IFAGLTRYTKSIYPSYAVHVINNIVATFPFLLTFLHRVLGZ 



MNKKQWLGLGLVAVAAVGLAACGNR5SRNAASSSDVKTKAAIVTDTGGVDDKSFNQSAWEGLQAWGKEHNLSKDN 
GFTYFQSTSEADYANNLQQAAGSYNLIFGVGFALNNAVKDAAKEHTDLNYVLIDDVIKDQKNVASVTFADNESGYLA 
GVAAAKTTKTKQVGFVGGIESEVISRFEAGFKAGVASVDPSIKVQVDYAGSFGDAAKGKTIAAAQYAAGADIVYQVAG 
GTGAGVFAEAKSLNESRPENEKVWVIGVDRDQEAEGKYTSKDGKESNFVLVSTLKQVGTTVKDISNKAERGEFPGGQV 
IVYSLKDKGVDLAVTNLSEEGKKAVEDAKAKILDGSVKVPEKZ 
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MSKKLQQISVPLISVFLGILLGAIVMWIFGYDAIWGYEELFYTAFGSLRGIGEIFRAMGPLVLIGLGFAVASRAGFFNVGL 
PGQALAGWILSGWFALSHPDMPRPLMILATIVIALIAGGIVGAIPGILRAYLGTSEVIVTIMMNYIVLYVGNAFIHAFPKD 
FMQSTDSTIRVGANATYQTPWLAELTGNSRMNIGIFFA1IAVAVIWFMLKKTTLGFEIRAVGLNPHASEYAGISAKRTIIL 
SMIISGALAGLGGAVEGLGTFQNVYVQGSSLAIGFNGMAVSLLAANSP1GILFAAFLFGVLQVGAPGMNAAQVPSELVSI 
5 VTASIIFFVSVHYLIERFVKPKKQVKGGKZ 



MGVKKKLKLTSLLGLSLLIMTACATNGVTSDITAESADFWSKLVYFFAEIIRFLSFDISIGVGIILFTVLIRTVLLPVFQVQ 
MVASRKMQEAQPR1KALREQYPGRDMESRTKLEQEMRKVFKEMGVRQSDSLWPILIQMPVILALFQALSRVDFLKTGH 
10 FLWINLGSVDTTLVLPILAAVFTFL^TWLSNKAI^ERNGATTAMMYGIPVLIFIFAVYAPGGVALYWTVSNAYQVLQTY 
FLNNPFKIIAEREAVVQAQKDLENRKRKAKKKAQKTKZ 



MVIDPFAINELDYYLVSHFHSDHIDPYTAAAILNNPKLEHVKFIGPYHCGRJWEGWGVPKERIIVVKPGDTIELKDMKIH 
15 AVESFDRTCLVTLPVNGADETGGELAGLAVTDEEMAQKAVNYIFETPGGTIYHGADSHFSNYFAKHGKDFKIDVALNN 
YGENPVGIQDKMTSIDLLRMAENLRTKVIIPVHYDIWSNFMASTNEILELWKMRKDRLQYDFHPFIWEVGGKYTYPQD 
QHLVEYHHPRGFDDCFEQDSNIQFKALL2 



20 MFLSGWLSSFANTYIHDLLGVLFPDSPFLNAFESAIAAPLVEEPLKLLSLVFVLALIPVRKLKSLFLLGIASGLGFQMIKDI 
GYIRTDLPEGFDFTISR1LERIISGIASHWTFSGLAVVGVYLLYRAYKGQKVGKKQGLIFLGLALGTHFLFNSPFVELETEL 
PLAIPVVTAIALYGFYHAYCFVEKHNELMTZ 



25 MKVEPRCDVLSRMSHFFFRILIMELQELVERSWAIRQAYHELEVKHHDSKWTVEEDLLALSNDIGNFQRLVMTKQGRY 
YDETPYTLEQKLSENIWWLLELSQRLDIDILTEMENFLSDKEKQLNVRTWKZ 



MLDWKQFFLAYLRSRSRLFIYLLSLAFLVLLFQFLFASLGIYFLYFFFLCCFVTILFFTWDILVETQVYRQELLYGEREAK 
30 SPLEIALAEKLEAREMELYQQRSKAERKLTDLLDYYTLWVHQIKTPIAASQLLVAEVVDRQLKQQLEQEIFKIDSYTNLV 
LQYLRLESFHDDLVLKQVQIEDLVKEIIRKYALFFIQKGLNVNLHDLDKEIVTDKICWLLVVIEQIISNSLKYTKEGGLEIY 
MDDQELCIKDTGIGIKNSDVLRVFERGFSGYNGRLTQQSSGLGLYLSKKISEELGHQJRJESEVGKGTTVRIQFAQVNLVL 
EZ 

35 

MELNTHNAEILLSAANKSHYPQDELPEIALAGRSNVGKSSFINTMLNRKNLARTSGKPGKTQLLNFFNIDDKMRFVDVP 
GYGYARVSKKEREKWGCMIEEYLTTRENLRAVVSLVDLRHDPSADDVQMYEFLKYYEIPVIIVATKADKIPRGKWNKH 
ESAIKKKLNFDPSDDFILFSSVSKAGMDEAWDAILEKLZ 

40 

MTKKQLHLVIVTGMSGAGKTVAIQSFEDLGYFTIDNMPPALLPKFLQLVEIKEDNPKLALWDMR5RSFFSEIQAVLDEL 
ENQDGLDFKILFLDAADKELVARYKETRRSHPLAADGRILDGIKLERELLAPLKNMSQNVVDTTELTPRELRKTLAEQF 
SDQEQAQSFRIEVMSFGFKYGIPIDADLVFDVRFLPNPYYLPELRNQTGVDEPVYDYVMNHPESEDFYQHLLALIEPILP 
SYQKEGKSVLT1AMGCTGGQHRSVAFAKRLAQDLSKNWSVNEGHRDKDRRKETVNRSZ 

45 

MRKPKITVIGGGTGSPVILKSLREKDVEIAAIVTVADDGGSSGELRKNMQQLTPPGDLRNVLVAMSDMPKFYEKVFQYR 
FSEDAGAFAGHPLGNLIIAGLSEMQGSTYNAMQLLSKFFHTTGKIYPSSDHPLTLHAVFQDGTEVAGESHIVDHRGIIDN 
VYVTNALNDDTPLASRRVVQTILESDMIVLGPGSLFTSILPNIVIKEIGRALLETKAEIAYVCNIMTQRGETEHFTDSDHV 
30 EVLHRHLGRPFIDTVLVNIEKVPQEYMNSNRFDEYLVQVEHDFVGLCKQVSRVISSNFLRLENGGAFHDGDLIVDELMR 

IQVKKZ t 



*•» 

55 MKNLiKLLIlRLIVNLADSVFYIVALWHVSNNYSSSMFLGIFlAVNYLPDLLUFFGPVIDRVNPQKILIISILVQLAVAVIFL 
LLLNQISFWVIMSLVFISVMASSISYVIEDVUPQVVEYDKIVFANSLFSISYKVLDSIFNSFASFLQVAVGFILLVKIDIGIFL 
LALFILLLLKFRTSNANIENFSFKYYKREVLQGTKFILNNKLLFKTSISLTLINFFYSFQTVVVPIFSIRYFDGPIFYGIFLTIA 
GLGGILGNMLAPIVIKYLKSNQIVGVFLFLNGSSWLVAIVIKDYTLSLILFFVCFMSKGVFNIIFNSLYQQIPPHQLLGRVN 
TTIDSIISFGMPIGSLVAGTLIDLNIELVLIAISIPYFLFSYIFYTDNGLFCEFSIYZ 



MMSNKNKEILIFAILYTVLFMFDGVKLLASLMPSAIANYLVYVVLALYGSFLFKDRI.IQQWKEIRKTKRKFFFGVLTGNV 
LFLILMTVVFEFVSEMLKQFVGLDGQGLNQSNIQSTFQEQPLLIAVFACVIGPLVEELFFRQVLLHYLQERLSGLLSIILV 
GLVFALTHMHSLALSEWIGAVGYLGGGLAFSIIYVKEKENIYYPLLVHMLSNSLSLIILAISIVKZ 



87 



riT/uiiyy/02451 



LIYrGLGLF^AY^NSvS^ 



QFLVWAVLALGFNTSYLRLYSALILAVCLMIPTFKQTILKGAKLSKZ 



PHHHBHB 
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Table 3 

ID201 - 4106.4 

5 ATGATAAAAAATCCTAAATTATTAACCAAGTCTTTTTTAAGAAGTTTTGCAATTCTAGGTGGTGTTGGTCTAGTCAT 

TC AT AT AG CT AT TT ATTTG A CCT TT CCTTTTT ATT AT ATTC AACTGG AGGGGG AAAAGTT T AATGAG AG CG CAAG AG 
TGTTTACGGAGTATTTAAAGACTAAGACATCTGATGAAATTCCAAGCTTACTCCAGTCTTATTCAAAGTCCTTGACC 
ATATCTGCTCACCTTAAAAGAGATATTGTAGATAAGCGGCTCCCTCTTGTGCATGACTTGGATATTAAAGATGGAAA 
G CTAT C AAATT ATAT CGTGATGTT AG AT ATGTCTGTT AGT ACAGC AGATGGT AAACAGG T AAC C GTGCAATT TGTTC 
10 ACGGGGTGGATGTCT ACAAAG AAGCAAAG AAT ATTTTG C TTTTGT ATCT CC CAT AT ACATTTTTGGTT AC AATTGCT 

TTTTCCTTTGTTTTTTCTTATTTTTATACTAAACGCTTGCTCAATCCTCTTTTTTACATTTCAGAAGTGACTAGTAA 
AATGCAAGATTTGGATGACAATATTCGTTTTGATGAAAGTAGGAAAGATGAAGTTGGTGAAGTTGGAAAACAGATTA 
ATGGTATGTATGAGCACTTGTTGAAGGTTATTTATGAGTTGGAAAGTCGTAATGAGCAAATTGTAAAATTGCAAAAT 
CAAAAGGTTTCCTTTGTCCGCGGAGCATCACATGAGTTGAAAACCCCTTTAGCCAGTCTTAGAATTATCCTAGAGAA 

15 T ATGC AGCAT AAT ATTGG AG ATT AC AAAG AT CAT C CAAAAT AT ATTGCAAAG AGT ATAAAT AAG ATTG AC CAGATGA 

GCCACTTATTAGAAGAAGTACTGGAGTCTTCTAAATTCCAAGAGTGGACAGAGTGTCGTGAGACCTTGACTGTTAAG 
CCAGTTTT AGT AG AT ATTTT AT C ACGTT ATCAAG AAT T AG CTC ATT CAAT AGGTGTT AC AATTG AAAATC AATTGAC 
AGATGCTACCAGGGTCGTCATGAGTCTTAGGGCATTGGATAAGGTTTTGACAAACCTGATTAGTAATGCAATTAAAT 
ATTCAGATAAAAATGGGCGT GTAATCATATCCGAGC AAG ATGGCT ATCT CT CTATCAAAAATACATGTGCGC CTCT A 

20 AGTGACCAAGAACTAGAACATTTATTTGATATATTCTATCATTCTCAAATCGTGACAGATAAGGATGAAAGTTCCGG 
TTTGGGTCTTTACATTGTGAATAATATTTTAGAAAGCTATCAAATGGATTATAGTTTTCTCCCTTATGAACACGGTA 

TGGAATTTAAGATTAGCTTGTAG 

25 MIKNPKLLTKSFLRSFAILGGVGLVIHIAIYLTFPFYYIQLEGEKFNESARVFTEYLKTKTSDEIPSLLQSYSKSLT 
ISAIJLKRDIVDKRLPLVHDLiDIKDGKLSNYIVMLDMSVSTM 

FSFVFSYFYTKRLI^PLFYISEVTSKMQDUDDNIRFDESRKDEVGEVGKQINGMYEHLLKVIYELESRNEQIVKLQN 
QKVSFVRGASHELKTPLASLRirLENMQHNIGDYKT>HPKYIAKSINKIDQMSHLLEEVLESSKFQEWTECRETLTVK 
PVLVDILSRYQELAHSIGVTIEMQLTDATRVVMSLRAIjDiCVLTNLIiSNAIKYSDKNGRVI ISEQDGYLSIKNTCAPL 
30 SDQELEHLFDIFYHSQIVTDKDESSGLGLYIVNNILESYQMDYSFLPYEHGMEFKISLZ 

ID202 - 4106.9 

ATGGATAAAATTATTAAAACTATATCAGAAAGCGGAGCCTTTCGTGCTTTTGTCCTTGATAGCACTGAAACCGTCCG 
35 CACTGCTCAAGAAAAACATCAAACCCAAGCTAGCTCAACTGTAGCGCTTGGTCGAACTCTTATCGCTAGCCAGATTC 
TCGCAGCCAATGAAAAAGGAAATACCAAACTTACAGTTAAGGTGTTGGGATCTAGCTCTCTAGGTGCTATTATCACC 
GTCGCTGATACCAAGGGGAACGTCAAAGGCTATGTTCAAAATCCTGGTGTTGACATCAAAAAGACTGCGACTGGTGA 

AGTCCTAGTCGGACCTTTTGTTGGAAATGGTCAATTCCTCGTTATCA 

CTATAACTCCCCTCATCTCTGGAGAAATCGGTGAAGACCTTGCCTTTTACCTTACTGAAAGCCAACAAACGCCTTCA 
40 GCGGTCGGCCTCAATGTCCTTTTGGACGAGGAAGACAAGGTCAAGGTTGCAGGTGGTTTCCTAGTTCAAGTCTTGCC 

AGG AGC CAAG AAAG AAG AG ATTGC T CG CTTTGAAAAAC G C AT C CAAG AAATG CC AGCT AT CT CT AC T CTTCT CGAAA 
GCGACGACCATATCGAAGCCCTCCTCAAGGCTATCTACGGGGACGAAGCCTACAAGCGTCTTTCTGAAGAAGAAATC 
CGTTTCCAATGTGACTGTAGCCATGAACGCTTTATGAACGCTCTTGCCAGCCTTCCAAGCTCAGACTTACAGGAAAT 
GAAAGAGGAAGACCACGGGGCAGAAATCACTTGTCAATTCTGCCAAACTACTTACAACTTTGATGAAAAGGACCTGG 

45 AGGAACTCATTCGTGACAAATCTTAA 



MDKIIKTISESGAFRAFVLDSTETVRTAQEKHQTQASST^^ 

VADTKGNVKGYVQNPGVDIKKTATGEVLVGPFVGNGQFLVITDYGTGNPYNSITPLISGEIGEDLAFYLTESQQTPS 
50 AVGLWLLDEEDK3/K^AGGFLVQVLPGAKKEEIARFEKRIQEMPAISTLLESDDHIEALLKAIYGDEAYKRL,SEEEI 
RFQCDCSHERFMNALASL.PSSDL.QEMKEEDHGAEITCQFCQTTYNFDEKXILEELIRDKSZ 



ID203 - 4115 

ATGAAATCAATAACTAAAAAGATTAAAGCAACTCTTGCAGGAGTAGCTGCCTTGTTTGCAGTATTTGCTCCATCATT 
TGTATCTGCTCAAGAATCATCAACTTACACTGTTAAAGAAGGTGATACACTTTCAGAAATCGCTGAAACTCACAACA 
CAACAGTTGAAAAATTGGCAGAAAACAACCACATTGATAACATTCATTTGATTTATGTTGATCAAGAGTTGGTTATC 
GATGGCCCTGTAGCGCCTGTTGCAACACCAGCGCCAGCTACTTATGCGGCACCAGCCGCTCAAGATGAAACTGTTTC 
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AGCTC CAGT AGCAG AAACT C CAGT AG T AAGTG AAAC AG TTG TTTC AACTGTAAGCGG ATCTGAAG CAG AAGCCAAAG 
AATGGATCG CTCAAAAAGAAT CAGGTGGTAGTATACAGCT ACAAATGG ACGTTATATCGGACGTTACCAATTAA 

MKSITKKIKATLAGVAALFAVFAPSFVSAQESSTYWKEGDTLSEIAET^ 

DGPVAPVATPAPATYAAPAAQDETVSAPVAETPWSETWSTVSGSEAEAKEWIAQKESGGSIQLQMDVISDVTN2 
ID204 - 4117.1 

atgaatttaggagaattttggtacaataaaataaataagaacag 

TTTTTTAGCTCTGCTATTTTTCTTAGCTAGTCCAGAGGGTGCAATGGCTAGTGATGGTACTTGGCAAGGAAAACAGT 

atctgaaagaagatggcagtcaagcagcaaatgagtgggtttttgatactcattatcaatcitggttctatataaaa 

GC AG ATGCT AACTATG CTGAAAATGAATGG CTAAAG C AAGGTG ACGAC T ATTTTTACCTCAAATCTGGTGG CTAT AT 
GGCCAAATCAGAATGGGTAGAAGACAAGGGAGCCTTTTATTATCTTGACCAAGATGGAAAGATGAAAAGAAATGCTT 
GGGTAGGAACTTCCTATGTTGGTGCAACAGGTGCCAAAGTAATAGAAGACTGGGTCTATGATTCTCAATACGATGCT 
TGGTTTTAT ATCAAAGC AG ATGGACAG C AC GCAG AG AAAG AATGGCTCC AAATT AAAGGG AAGGACT ATT ATTTCAA 
ATCCGGTGGTTATCTACTGACAAGTCAGTGGATTAATCAAGCTTATGTGAATGCTAGTGGTGCCAAAGTACAGCAAG 
GTTGGCTTTTTGACAAACAATACCAATCTTGGTTTTACATC AAAG AAAATGG AAACT ATGCTG AT AAAGAATGG ATT 
TTCGAGAATGGTCACTATTATTATCTAAAATCCGGTGGCTACATGGCAGCCAATGAATGGATTTGGGATAAGGAATC 
TTGGTTTTATCTCAAATTTGATGGGAAAATGGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACT 
ACTTCAAATCCGGTGGTTACATGACAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATCTGAT 
GGGAAAATAGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCCGGTGGTTACAT 
GACAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTACCTCAAATCTGATGGGAAAATAGCTGAAAAAGAAT 
GGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCTGGTGGCTACATGGCGAAAAATGAGACAGTAGAT 
GGTTATCAGCTTGGAAGCGATGGTAAATGGCTTGGAGGAAAAACTACAAATGAAAATGCTGCTTACTATCAAGTAGT 
GCCTGTTACAGCCAATGTTTATGATTCAGATGGTGAAAAGCTTTCCTATATATCGCAAGGTAGTGTCGTATGGCTAG 
ATAAGGATAGAAAAAGTGATGACAAGCGCTTGGCTATTACTATTTCTGGTTTGTCAGGCTATATGAAAACAGAAGAT 
TTACAAGCGCTAGATGCTAGTAAGGACTTTATCCCTTATTATGAGAGTGATGGCCACCGTTTTTATCACTATGTGGC 
T CAGAATG CTAGTATCCC AGTAGCTTCTCATCTTTCTG AT ATGGAAGT AGG C AAG AAATATTATT CGG CAG ATGGCC 
TGCATTTTGATGGTTTTAAGCTTGAGAATCCCTTCCTTTTCAAAGATTTAACAGAGGCTACAAACTACAGTGCTGAA 
GAATTGGATAAGGTATTTAGTTTGCTAAACATTAACAATAGCCTTTTGGAGAACAAGGGCGCTACTTTTAAGGAAGC 
CGAAG AACATTAC CATATCAATGCTCTTT AT CTC CT TG C CCATAGTG C C CTAGAAAGT AACTGGGGAAGAAG T AAAA 
TTGCCAAAGAT AAG AATAATTT CTTTGG CATTACAG CCTATG AT ACGACCC CTT AC CTTTCTG CT AAG ACATTTG AT 
GATGTGGATAAGGGAATTTTAGGTGCAACCAAGTGGATTAAGGAAAATTATATCGATAGGGGAAGAACTTTCCTTGG 

AAACAAGGCTTCTGGTATGAATGTGGAATATGCTTCAGACCCTTATTGGGGCGAAAAAATTGCTAGTGTGATGATGA 
AAATCAATG AG AAGC TAG GTGG CAAAG ATTAG 

MNLGEFWYNKINKNRGRRLMKKVRFI FLALLFFLAS PEGAMASDGTWQGKQYLKEDGSQAANEWFDTHYQSWFYr K 

ADAWYAENEWLKQGDDYFYLK^GGYMAKSEWVEDKGAFYYLDQDGKMKRNAWGTSWGATGAKVIEDWVYDSQYDA 

WFY I KADGQHAEKEWLQ I KGKDYYFKSGGYLLTSQW ItfQAYVNASGAKVQQGWLFDKQYQSWFYI KENGNYADKEWI 

FENGHYYYLKSGGYMAANEWIWDKESWFYLKFDGKMAEKEV/VYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSD 

GKIAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVYDSHSQAW 

GYQLGSDGKWLGGKTTNENAAYYQWPVTAW^ 

LQALDASKDFIPYYESDGHRFYHYVAQMASIPVASHLSDMEVGKKYYSADGLHFDGFKLENPFLFKDLTEATNYSAE 
ELDKVFSLLNINNSLLENKGATFKEAEEHYHINALYLLAHSAL 

DVDKGILGATKWIKENYIDRGRTFLGNK^SGM^EYASDPYWGEKIASVMMKINEKLGGKDZ 
ID2Q5 - 4118.1 

ATGAAAAAATTAGGTACATTACTCGTTCTCTTTCTTTCTGCAATCATTCTTGTAGCATGTGCTAGCGGAAAAAAAGA 
TACAACTTCTGGTCAAAAACTAAAAGTTGTTGCTACAAACTCAATCATCGCTGATATTACTAAAAATATTGCTGGTG 
ACAAAATTGACCTTCATAGTATCGTTCCGATTGGGCAAGACCCACACGAATACGAACCACTTCCTGAAGACGTTAAG 
AAAAGTTCTGAGGCTAATTTGATTTTCTATAACGGTATCAACCTTGAAACAGGTGGCAATGCTTGGTTTACAAAATT 
GGTAGAAAATGCCAAGAAAACTGAAAACAAAGACTACTTCGCAGTCAGCGACGGCGTTGATGTTATCTACCTTGAAG 
GTCAAAATGAAAAAGGAAAAGAAGACCCACACGCTTGGCTTAACCTTGAAAACGGTATTATTTTTGCTAAAAATATC 
GCCAAACAATTGAGCGCCAAAGACCCTAACAATAAAGAATTCTATGAAAAAAATCTCAAAGAATATACTGATAAGTT 
AGACAAACTTGATAAAGAAAGTAAGGATAAATTTAATAAGATCCCTGCTGAAAAGAAACT 

^^^T ACTTC T CT AAAGC CT ATGGTGTC C CAAGTG CTT AC ATCTGGG AAAT CAATACTGAAGAAG AAGGA 
ACTCCTGAACAAATCAAGACCTTC " — 



GGATGACCGTCCAATGAAAACTGTTTCTCAAGACACAAACATCCCAATCTACGCTCAAAT* 



CTTTACTGACTCTATCG 
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CAGAACAAGGTAAAGAAGGCGACAGCTACTACAGCATGATGAAATACAACCTTGACAAGATTGCTGAAGGATTGGCA 
AAATAA 

5 MKKIjGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNS IIADITKNIAGDKIDLHSIVPIGQDPHEYEPLPEDVK 

KTSEANLIFYNGINLETGGNAWFTKLVENAKXTENKIDYFAVSDGVDVIYLEGQNEKGKEDPHAWLNLENGIIFAKN 
AKQLSAKDPNNKE FYEKNLKEYTDKLDKLDKESKDKFNKI PAEKKL.I VTSEGAFKYFS KAYGVPSAYIWEINTEEEG 
TPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGDSYYSMMKYNLDKIAEGLA 
KZ 

10 

ID2Q6 - 4119.1 

ATGGAATGGTATAAAAAAATCGGACTTCTTGCAACTACAGGTTTAGCTTTGTTTGGGCTCGGCGCTTGCTCCAACTA 
TGGTAAATCTGCGGATGGCACAGTGACCATCGAGTATTTCAACCAGAAAAAAGAAATGACCAAAACCTTGGAAGAAA 
TCACTCGTGATTTTGAGAAGGAAAACCCTAAGATCAAGGTCAAAGTCGTCAATGTACCAAATGCTGGTGAAGTATTG 

15 AAGACACGCGTTCTCGCAGGAGATGTGCCTGATGTGGTCAATATTTACCCACAGTCCATCGAACTGCAAGAATGGGC 
AAAAGCAGGTGTTTTTGAAGATTTGAGCAACAAAGACTACCTGAAACGCGTGAAAAATGGCTACGCTGAAAAATATG 
CTGTAAACGAAAAAGTTTACAACGTTCCTTTTACAGCTAATGCTTATGGAATTTACTACAACAAAGATAAATTCGAA 
GAACTGGGCTTGAAGGTTCCTGAAACCTGGGATGAATTTGAACAGTTAGTCAAAGATATCGTTGCTAAAGGACAAAC 
AC CATTTGGAATT GC AGGTG C AG ATG CTTGGACACT CAATGGTT ACAAT CAATT AG CCTTTGCGAC AG C AACAGGTG 

20 GAGGAAAAGAAGCAAATCAATACCTTCGTTATTCTCAACCAAATGCCATTAAATTGTCGGATCCGATTATGAAAGAT 
GATATCAAGGTCATGGACATCCTTCGCATCAATGGATCTAAGCAAAAGAACTGGGAAGGTGCTGGCTATACCGATGT 
TATCGGAGCCTTCGCACGTGGGGATGTCCTCATGACACCAAATGGGTCTTGGGCGATCACAGCGATTAATGAACAAA 
AACCGAACTTTAAGATTGGGACCTTCATGATTCCAGGAAAAGAAAAAGGACAAAGCTTAACCGTTGGTGCGGGAGAC 
TTGGCATGGTCTATCTCAGCCACGACCAAACATCCAAAAGAAGCCAATGCCTTTGTGGAATATATGACCCGTCCAGA 

25 AGTCATGCAAAAATACTACGATGTGGACGGATCTCCAACAGCGATCGAAGGGGTCAAACAAGCAGGAGAAGATTCAC 
CGCTTGCTGGTATGACCGAATATGCCTTTACGGATCGTCACTTGGTCTGGTTGCAACAATACTGGACCAGTGAAGCA 
GACTTCCATACCTTGACCATGAACTATGTCTTGACCGGTGATAAACAAGGCATGGTCAATGATTTGAATGCCTTCTT 
TAACCCGATGAAAGCGGATGTGGATTAG 

30 - 

MEWYKKIGLI^TTGIALFGLGACSNYGKSADGTVTIEYFNQKKE 

KTRVLAGDVPDWNIYPQSIELQEWAKAGVFEDLSNKDYLKRVKNGYAE KYAVNEKVYNVPFTANAYGI YYNKDKFE 
ELGLKVPETWDEFEQLVKDIVAKGQTPFGIAGADAWTI^GY^QLAFATATGGGKEANQYLRYSQPNAIKLSDPIMKD 
DI KVMDILRINGS KQKNWEGAGYTDVIGAFARGDVLMTPNGSWAITAINEQKPNFKIGTFMI PGKEKGQSLTVGAGD 
35 LAWSISATTKHPK£ANAFVEYMTRPEVMQKYYDVDGSPTAIEG^^ 
D FHT L TMN YVLTGD KQGMVND LN AF FN PM KAD VD Z 

ID207 - 4123.1 

ATGAAGAAAATCAAACCGCATGGACCGTTACCAAGTCAGACTCAGCTAGCTTATCTGGGAGATGAACTAGCAGCTTT 

40 TATCCACTTCGGTCCTAATACCTTTTATGACCAAGAATGGGGGACTGGACAGGAGGATCCTGAGCGCTTTAACCCGA 
GT CAGTTGGAT G C G CGTG AGTGG GT T CGTG TG CT C AAGG AAACGGG CTTC AAAAAGTTG ATTTTGGTGGTC AAGC AC 
CACGATGGCTTTGTCCTTTATCCGACAGCTCACACAGATTATTCGGTTAAGGTCAGTCCTTGGAGGAGAGGAAAGGG 
CGACTTGCTCCTTGAAGTATCCCAAGCTGCCACAGAGTTTGATATGGATATGGGGGTCTACCTGTCACCGTGGGATG 
CCCATAGTCCCCTCTATCATGTGGACCGAGAAGCGGACTACAATGCCTATTATCTGGCTCAGTTGAAGGAAATCTTA 

45 TCAAATCCTAACTATGGGAATGCTGGTAAGTTCGCTGAGGTTTGGATGGATGGTGCCAGAGGAGAGGGCGCGCAAAA 
GGTTAATTATGAATTTGAAAAATGGTTTGAAACCATTCGTGACCTGCAGGGCGATTGCITGATTTTTTCAACAGAAG 
GCACCAGTATCCGCTGGATTGGCAATGAACGAGGGTATGCAGGTGATCCACTGTGGCAAAAGGTGAATCCTGATAAA 
CTAGGAACAGAAGCAGAGCTGAACTATCTTCAGCACGGGGATCCCTCGGGCACGATTTTTTCAATCGGAGAGGCAGA 
TGTTTCCATCCGTCCAGGCTGGTTCTACCATGAGGATCAGGATCCTAAGTCTCTCGAGGAGTTGGTCGAAATCTACT 

50 TTCACTCA.GTAGGGCGAGGAACTCGACTCTTGCTTAATATTCCGCCGAATCAAGCTGGGCTCTTTGATGCAAAGGAT 
ATTG AAC GACTTT ATG AATTTGCG AC CT AT CG C AATGAGCT CTAT AAAG AAGATTTGGC TCTG GGAG CTGAGGT ATC 
TGGTCCAGCTCTTTCCGCAGACTTTGCTTGTCGCCATTTGACAGACGGCCTTGAGACCAGCTCTTGGGCAAGCGATG 
CAGACTTGCCCATCCAGTTAGAACTCGACTTAGGTTCTCCTAAAACTTTTGATGTAATTGAGTTAAGAGAAGATTTG 
AAGCTAGGGCAACGAATCGCTGCTTTTCATGTGCAAGTAGAGGTGGATGGTGTCTGGCAGGAGTTTGGTTCGGGTCA 

55 TACTGTTGGTTACAAACGTCTCTTACGAGGAGCAGTTGTTGAGGCACAGAAGATACGTGTAGTCATTACAGAATCAC 
AGGCTTTGCCTTTGTTGACCAAGATTTCCCTTTATAAAACTCCTGGATTATCAAAAAAAGAAGTTGTTCAGGAACTA 
GCATTTGCAGAAAAAAGCCTAGCTGTGGCAAAGGGAGAAAATGCCTATTTTACAGTTAAGCGCAGAGAATGT AGTGG 
TCCTTTAGAAGCTAAGATTTCGATTCAACCGGGGACAGGTGTCCATGGTGTCGCCTATCAGGATGAGATTCAAGTCC 
TTGCGTTTGAAACTGGTGAGACTGAAAAAAGTCTGACGCTACCAACCTTGTATTTCGCAGGAGATAAAACCTTGGAT 

60 TTCTATCTGAACCTAACGGTGGATGGTCAGCTTGTGGATCAACTTCAAGTCCAAGTTTCATAA 
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MKKIKPHGPLPSQTQLAYLGDELAAFIHFGPNTFYDQEWGTGQEDPERFNPSQLDAREWVRVLKETGFKKLILWKH 
HDGFVLYPTAHTDYSVKVSPWRRGKGDLLLEVSQAATEFDMDMGVYLSPWDAHSPLYHVDREADYNAYYLAQLKEIL 
SNPNYGNAGKFAEVWMDGARGEGAQKWYEFEKWFETIRDLQGDCLIFSTEGTSIRWIGNERGYAGDPLWQKVNPDK 
LGTEAELNYLQHGDPSGTIFSIGEADVSIRPGWFYHEDQDPKSLEELVEIYFHSVGRGTPLLLNIPPNQAGLFDAKD 
IERLYEFATYRNELYKEDLALGAEVSGPALSADFACRHLTDGLETSSWASDADLPIQLELDLGSPKTFDVIELREDL 
KLGQRIAAFHVQVEVDGVWQEFGSGHTVGYKRLLRGAWEAQKIRWITESQALPLLTKISLYKTPGLSKKEVVQEL 
AFAEKSLAVAKGENAYFTVKRRECSGPLEAKISIQPGTGVHGVAYQDEIQVLAFQTGETEKSUTLPTLYFAGDKTLD 
F Y LN LTVDGQ LVDQLQVQVS Z 

ID208 - 4125.12 

ATGCTTGAAAGACTGAAAAGAATACATTATATGTTTTGGATCAGTTTAATTTTTATGATTTTCCCCATCCTGTCTGT 
AGTGACTGGGTGGCTTTCTGCCTGGCATTTATTGATTGATATTCTATTTGTAGTGGCATATTTGGGTGTTTTAACAA 
CTAAGAGCCAGCGCCTATCTTGGCTATATTGGGGCCTCATGCTGACTTATGTAGTTGGGAATACTGCCTTTGTTGCT 
GTTAATTATATCTGGTTTTTCTTTTTCCTATCCAATCTCTTAAGTTATCATTTCAGCGTACGTAGTTTAAAGTCTTT 
ACATGTCTGGACTTTTCTTCTTGCTCAAGTCCTTGTTGTGGGGCAACTGTTGATTTTTCAGAGAATCGAAGTTGAGT 
TT CT ATTCT ATCT ACTTGT AATT C TT ACTTTTGT CG ATTT AATGACTTTTGG ATTGGTTCGG ATT CGT ATTGT CG AG 
GATTTGAAAGAAGCTCAGGTCAAGCAAAATGCTCAGATAAATCTATTGCTTGCTGAAAATGAACGTAGTCGTATCGG 
TCAGGATTTGCATGATAGTCTGGGACATACCTTTGCTATGCTGAGTGTCAAGACAGATTTAGCCTTGCAGTTATTTC 
AGATGGAGGCTTATCCACAGGTGGAAAAGGAATTAAAAGAAATTCACCAGATCAGCAAGGATCCATGA 



ML ERL KR I H YM FWISLIFMIFPI LS WTG W LS AW H L L I D I L FW A Y LG V LTT KS Q RL S W L YWG LM LT YWGNTA FV A 
VNYIWFFFFLSNLLSYHFSVRSLKSLHVWTFL^QVLWGQLLIFQRIEVEFLFYLLVrLTFVDLMTFGLVRIRIVE 
DLKEAQVKQNAQrNLLLAENERSRIGQDLHDSLGHTFAMLSVKTDLALQLFQMEAYPQVEKELKEIHQISKDPZ 

ID209 - 4126.3 

ATGAATGATAAGTTAAAAATCTTCTTGTTGCTAGGAGTATTTTTTCTAGCCATAACCGGTTTCTATGTTCTATTGAT 
ACGAAAT G C AGGGCAG ACAGATG CCTCGC AAATTG AAAAGG CGGCAGTTAG CCAAGGAGGAAAAG CAG TG AAAAAAA 
CAGAAATTAGTAAAGACGCAGACTTGCACGAAATTTATCTAGCTGGAGGTTGTTTCTGGGGAGTGGAGGAATATTTC 
TCACGTGTTCCCGGGGTGACGGATGCCGTTTCAGGCTATGCAAATGGTAGAGGAGAAACAACCAAGTACGAATTGAT 
TAACCAAACAGGTCATGCAGAAACCGTCCATGTCACCTATGATGCCAAGCAAATTTCTCTCAAGGAAATCCTGCTTC 
ACTATTTCCGCATTATCAATCCAACCAGCAAAAATAAACAAGGAAATGATGTGGGGACCCAGTACCGTACTGGTGTT 
TATTACACAGATGACAAGGATTTGGAAGTGATTAACGAAGTCTTTGATGAGGTGGCTAAGAAATACGATCAACCTCT 
AGCAGTTGAAAAGG AAAACTTG AAGAATT TTGTGGTGG C TG AGG ATTAC C ATCAAG ACT AT CTCAAGAAAAAT CCAA 
ATGGCTACTGCCATATCAATGTTAATCAGGCGGCCTATCCTGTCATTGATGCCAGCAAATATCCAAAACCAAGTGAT 
GAGG AATTGAAAAAGAC C CTGTGAC CTG AGGAGT ATGC AGT T AC CC AGG AAAAT CAAACAG AACGAG CTTT CT CAAA 
CCGTTACTGGGATAAATTTGAATCCGGTATCTATGTGGATATAGCAACTGGGGAACCTCTCTTTTCATCAAAAGACA 
AATTTGAGTCTGGTTGTGGCTGGCCTAGTTTTACCCAACCCATCAGTCCAGATGTTGTCACCTACAAGGAAGATAAG 
TCCTACAATATGACGCGTATGGAAGTGCGGAGCCGAGTAGGAGATTCTCACCTTGGGCATGTCTTTACGGATGGTCC 
ACAGGACAAGGGCGGCTTACGTTACTGTATCAATAGCCTCTCTATCCGCTTTATTCCCAAAGACCAAATGGAAGAAA 
AAGg CTACGCTT ATTT ACTAGATTATGTTG ATT AA 



MNDKLKIFLLLGVFFLAITGFYVLLIRNAGQTDASQIEKAA 

SRVPGVTDAVSGYANGRGETTKYELIMQTGHAETVHVTYDAKQISLKEILLHYFRIINPTSKNKQGMDVGTQYRTGV 
YYTDDKI)LEVINQVFDEVAiCKYDQPI^VEKENLKNFVVAEDYHQDYLKKNPNGYCHI^^WQAAYPVrDASKYPKPSD 
EELKKTLSPEEYAVTQENQTERAFSNRYWDKFESGI YVDIATGEPLFSSKDKFESGCGWPSFTQPIS PDWTYKEDK 
SYNMTRMEVRSRVGDSHLGHVFTDGPQDKGGLRYCINSLSIRFIPKDQMEEKGYAYLLDYVDZ 

rD210 - 4127.1 

ATGAAAAAGAAATGGATGTATTATGCTGCTTGTTCTTCTAATGAATCTGCCGATGACAGTTCATCTGATAAAGGAGA 
CGGCGGTTCGCTAGTCGTTTATTCACCAAACTCAGAGGGCTTAATTGGAGCAACTATTCCTGCCTTTGAAGAAAAAT 
ATGGT ATCAAAGTAGAACTG ATT CAAG CT GGT ACTGG AG AACTTTT CAAAAAACT AGAGT CAG AAAAAG AAG TTC CT 
GTAGCTGATGTTATCTTTGGTGGTTCTTATACACAATATACTACCCACGGAGAACTCTTTGAAAACTATACTTCAAA 
AGAAAATGATAATGTTATCAAAGAATATCAAAACACAACTGGCTACTCTACTCCTTATACACTAGATGGTAGTGTTT 
TAATCGTCAACCCTGATTTAACTAAAGGCATGAACATCGAAGGATATAACGATCTTTTCAAACCTGAACTAAAAGGA 
AAAATCGCAACTGCTGACCCAGCAAACTCTTCTAGCGCCTTTGCTCAATTAACAAATATGCTACAAGCTCAAGGTGG 
TTACPlAAGATGATAAGGCTTGGTCTTATGTAAAAGATCTTTTCACACTTATTGATGGTAAAATCGGTTCAAGTTCAT 
CTAGTGTCTATAAAGTAGTCGCTGATGGAGAAATGGCTGTTGGTCTCTCTTATGAAGATCCAGCAGTTAAACTCTTA 
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AATGACGGAGCTAACATTAAGGTAGTCTATCCAAAAGAAGGAACCGTCTTCCTACCTGCTAGTGCTGCTATCGTTAA 
AAAATCTAAAAATATGGAAAATGCCAAGAAATTTATCGATTTTATTATCTCTCAAGAAGTACAAGATACACTTGGTA 
CAACCACTACTAACCGTCCTGTTCGTAAAAATGCTAAAACAAGCGAAAACATGAAACCAATTGACAAAATCAAAACA 
CTCACTGAAGATTATGATTATGTCATCAAGAATAAATCAGATATCGTTAAGAAATACAACGAAGTCTTTACAGATAT 
CCAATCTAAACAGTAA 



MKKKWMYYAACSSNESADDSSSDKGDGGSLVVYSPNSEGLIGATrPAFEEKYGIKVELIQAGTGELFKKLESEKEVP 

VADVIFGGSYTQYTTHGELFENYTSKENDNVIKEYQNTTGYSTPYTLDGSVLIVNPDLTKGMNIEGYNDLFKPELKG 

KIATADPANSSSAFAQLTNWLQAQGGYKDDKAWSYVKDLFTLIDGKIGSSSSSVYKVVAEGEMAVGLSYEDPAVKL^ 

^GANIKWYPKEGTVFLPASAAIVKKSKNMENAKK^ 

LTEDYDYVIKNKSDIVKKYNEVFTDIQSKQZ 

ID211 - 4127,2 

ATGAGTGAGATCAAAATTATTAACGCCAAAAAAATCTACCACGATGTCCCTGTTATTGAGAATTTGAACATTACAAT 
TC CAAAAGG AAGTCTCTTT AC CCTT CTTGG AG CT TC AGG ATG TGGG AAAACG AC C CTTCTTCGT ATG ATTGC AGGTT 
TCAACAGTATCGAAGGTGGAGAATTTTACTTCGATGATACAAAAATCAATAATATGGAACCCAGCAAACGCAATATC 
GGGATGGTTTTCCAAAACTACGCTATTTTCCCACATTTGACTGTCCGAGACAACGTTGCTTTTGGTCTTATGCAAAA 
GAAGGTTCCAAAAGAAGAATTGATTCAACAGACCAACAAGTATCTTGAACTCATGCAAATTGCTCAATATGCGGATC 
GAAAGCCCGATAAACTCAGTGGTGGACAACAACAACGTGTCACCTTGGCATGCGCCTTAGCGGTTAATCCAAGTGTT 
CTCCTCATGGACGAGCCACTTAGTAATCTGGAGGCCAAACTTCG CTTGG ATATGCGTCAAGCCATCCGAGAAATCCA 
ACACGAAGTGGGAATTACAACTGTTTATGTAACCCACGACCAAGAAGAAGCCATGGCTATTTCAGACCAAATTGCTG 
TTATGAAAGATGGGGTGATCCAACAAATCGGCCGACCAAAAGAACTCTATCATAAACCAGCTAATGAGTTTGTGGCA 
ACCTTTATCGGACGCACAAATATTATCCCTGCCAATCTTGAAAAACGGAGCGACGGCGCTTATATCGTCTTTTCAGA 
TGGCTATGCCCTTCGAATGCCAGCTCTTGATCAGGTTGAGGAGCAAGCTATTCATGTAAGCATTCGTCCCGAAGAGT 
TTATCAAAGATGAATCTGGAGATATTGAAGGAACTATTAGAGATAGCGTCTATCTTGGACTAAATACGGATTATTTC 
ATTGAGACAGGTTTTGCCTCAAAAATTCAAGTTAGTGAAGAATCAACTTTTGAAGAAGATCTACAAAAAGGCAATCG 
TATTCGTCTACGAATCAATACGCAAAAATTAAACATCTTTTCTGGAGATGGTTCCGAAAACCTGATAAAAGGAGTCA 
ACCATGGAACGTAA 

MSEIKI INAKKIYHDVPVIENLNITIPKGSLFTLLGASGCGKTTLLRMIAGFNSIEGGEFYFDDTKINNMEPSKRNI 
GMVFQNYAIFPHLTVRD^AFGLMQKK^PKEELIQQTNKYLELMQIAQYADRKPDKLSGGQQQRVTL^ 
LLMDEPLSNLEAKLRLDMRQAIREIQHEVGITTVYVTHDQEEAMAISDQIAVMKDGVIQQIGRPKELYHKPANEFVA 
TFIGRTNI IPANLEKRSDGAYIVFSDGYALRMPALDQVEEQAIRVSIRPEEFI KDESGDIEGTIRDSVYLGLNTDYF 
IETGFASKIQVSEESTFEEDLQKGNRIRLRINTQKLNIFSADGSQNLIKGVNHGTZ 

ID212 - 4136.1 

ATGAAGAAAAAATTATTGGCAGGTGCCATCACACTATTATCAGTAGCAACTTTAGCAGCTTGTTCGAAAGGGTCAGA 
AGGTG CAGACCTT ATCAGCATG AAAGGGG ATGTCATT ACAGAACATCAATTTT ATG AG CAAG TG AAAAG CAACCCT T 
CAGCCCAACAAGTCTTGTTAAATATGACCATCCAAAAAGTTTTTGAAAAACAATATGGCTCAGAGCTTGATGATAAA 
GAGGTTGATGATACTATTGCCGAAGAAAAAAAACAATATGGCGAAAACTACCAACGTGTCTTGTCACAAGCAGGTAT 
GACTCTTGAAACACGTAAAGCTCAAATTCGTAGAAGTAAATTAGTTGAGTTGGGAGTTAAGAAGGTAGCAGAAGCTG 
AATTGACAGATGAAGCCTATAAGAAAGCCTTTGATGAGTACACTCCAGATGTAACGGCTCAAATCATCCGTCTTAAT 
AATGAAGATAAGGCCAAAGAAGTTCTCGAAAAAGCCAAGGCAGAAGGTGCTGATTTTGCTCAATTAGCCAAAGATAA 
TTCAACTGATGAAAAAACAAAAGAAAATGGTGGAGAAATTACCTTTGATTCTGCTTCAACAGAAGTACCTGAGCAAG 
TCAAAAAAGCCGCTTTCGCTTTAGATGTGGATGGTGTTTCTGATGTGATTACAGCAACTGGCACACAAGCCTACAGT 
AGCCAATATTACATTGTAAAACTCACTAAGAAAACAGAAAAATCATCTAATATTGATGACTACAAAGAAAAATTAAA 
AACTGTTATCTTGACTCAAAAACAAAATGATTCAACATTTGTTCAAAGCATTATCGGAAAAGAATTGCAAGCAGCCA 
ATATCAAGGTTAAGGACCAAGCCTTCCAAAATATCTTTACCCAATATATCGGTGGTGGAGATTCAAGCTCAAGCAGT 
AGT AC AT C AAACG AAT AG 

MKKJ^LAGAITLLS^ATIAACSKGSEGAD^ 

EVDOTIAEEKKQYGENYQRVLS(^GMTLETRK^IRTSKLVEUAVKKVAEAEL^ 
NEDKAKEVLEKAKA£GAX3FAQLAKI)NSTDEKTKENGGEITFDSASTEVPE^ 

SQYYIVKLTKKTEKSSNIDDYKEKLKTVILTQKQNDSTFVQSI IGKELQAANIKVKDQAFQMIFTQYIGGGDSSSSS 
STSNEZ 
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ID213 - 4137.3 

ATGAAAAAAAATATTAAACAATATGTAACCTTA.GGTACTGTAGTGGTATTATCAGCATTTGTTGCTAACTCAGTTGC 
AGCTCAGGAGACTGAAACTTCTGAAGTATCAACACCAAAGTTGGTGCAACCTGTTGCACCAACGACTCCGATTTCGG 
AAGTACAACCTACATCGGATAACTCTTCGGAAGTTACTGTACAACCTCGAACAGTTGAAACTACTGTTAAGGATCCA 
5 TCTTCTACAGCGGAAGAAACTCCTGTCTTAGAAAAAAATAATGTTACTTTAACAGGGGGCGGAGAAAATGTTACTAA 
AG AGTTAAAGG AT AAATTT ACTAGCGGTGACTTT ACTGTAGTG ATT AAGT AC AAT CAGT CAAGTGAG AAAGG CTTAC 
AAGCTCTGTTTGGAATATCTAATTCCAAACCCGGTCAACAAAATAGTTATGTAGATGTGTTCCTTAGAGACAATGGT 
GAGTTGGGGATGGAAGCGCGTGATACTTCTTCCAATAAAAATAACCTAGTATCCAGACCTGCTTCAGTTTGGGGTAA 
GTACAAACAAGAGGCTGTGACTAACACTGTTGCAGTAGTAGCAGATTCAGTCAAAAAAACATATTCTTTATACGCAA 

1 0 AT GGT ACAAAAGT AGT AG AAAAGAAAGTGG AT AATTT C CTAAAC ATC AAGGAT ATTAAAGGTATTGATTACT ATATG 

CTTGGGGGAGTGAAACGTGCAGGAAAAACGGCGTTTGGTTTTAACGGAACACTAGAAAATATCAAATTCTTTAATAG 
TGCATTGGATGAAGAAACTGTTAAAAAGATGACAACAAACGCTGTTACTGGACATTTAATTTATACGGCTAATGATA 
CAACAGGTTCTAACTATTTCCGTATTCCAGTTCTGTATACTTTTAGCAATGGTCGGGTATTTTCAAGCATTGACGCT 
CGTT ACGGTGG AACT CATGATTT CTTG AATAAAATT AATATTGCT ACAAGT TAT AGTG ATGAT AATGGT AAG ACATG 

15 GACTAAACCAAAATTAACATTGGCATTCGATGATTTTGCGCCAGTACCATTAGAATGGCCTCGTGAAGTTGGTGGAC 
GTGACTTACAAATCAGCGGTGGTGCAACCTATATTGACTCTGTTATTGTTGAAAAAAAGAACAAACAAGTACTCATG 
TTTG CT GATGTGATG CCTG CTGGAG T AAGTTTTAG AG AAG C AACT AG AAAAG ATT CAGGTT AT AAACAAATTG ATGG 
TAATTATTACCTTAAATTAAGGAAACAAGGTGATACTGATTACAATTATACTATTCGTGAGAATGGTACTGTATACG 
ACGATCGTACCAACAGACCAACTGAATTTTCAGTAGATAAAAATTTCGGTATTAAACAAAATGGTAATTATTTGACG 

20 GTAGAGCGG 

MKKNIKQYVTLGTVWLSAFVANSVAAQETETSEVSTPKLVQ 
SSTAEETPVLEKNlWTLTGGGEm/TKELKDKFTSGDFTWIKTO^ 
• ELGMEARDTSSNKNNLVS RP AS VWG KY KQEAVTNTVAWADS VKKTYS L YANGTKWE KKVDNFLN I KD I KG I D YYM 
25 LGGVKRAGKTAFGFNGTLENIKFTOSAI.DEETVKKMTTNAVTGHLIYTANDTTGSNYFRIPVLYTFSNGRVFSSIDA 
RYGGTHDFLNKINIATSYSDDNGKTWTKPKLTLAFDDFAPVPLEWPREVGGRDLQISGGATYIDSVIVEKKNKQVLM 
FADVMPAGVS FREATRKDSGYKQIDGNYYLKLRKQGDTDYNYTIRENGTVYDDRTNRPTEFSVDKNFGI KQNGNYLT 
VER 

30 ID214 - 4185 

ATGAAAAAATTTAGCCTATTACTAGCTATCCTACCATTTTTGGTTGCCTGTGAGAATCAAGCTACACCCAAAGAGAC 
TAGCGCTCAAAAGACAATCGTCCTTGCTACAGCTGGCGACGTGCCACCATTTGACTACGAAGACAAGGGCAATCTGA 
CAGGCTTTGATATCGAAGTTTTAAAGGCAGTAGATGAAAAACTCAGCGACTACGAGATTCAATTCCAAAGAACCGCC 
TGGGAGAGCATCTTCCCAGGACTTGATTCTGGTCACTATCAGGCTGCGGCCAATAACTTGAGTTACACAAAAGAGCG 

35 TGCTGAAAAATACCTTTACTCGCTTCCAATTTCCAACAATCCCCTCGTCCTTGTCAGCAACAAGAAAAATCCTTTGA 
CTTCTCTTGACCAGATCGCTGGTAAAACAACACAAGAGGATACCGGAACTTCTAACGCTCAATTCATCAATAACTGG 
AAT CAGAAACAC ACTG AT AAT C CCG CT ACAATT AATTTT TCTGGTGAGG AT AT TGGTAAACG AAT C CT AG AC CTTG C 
TAACGGAGAGTTTGATTTCCTAGTTTTTGACAAGGTATCCGTTCAAAAGATTATCAAGGACCGTGGTTTAGACCTCT 
CAGTCGTTGATTTACCTTCTGCAGATAGCCCCAGCAATTATATCATTTTCTCAAGCGACCAAAAAGAGTTTAAAGAG 

40 CAATTTGATAAAGCGCTCAAAGAACTCTATCAAGACGGAACCCTTGAAAAACTCAGCAATACCTATCTAGGTGGTTC 
TTACCTCCCAGATCAATCTCAGTTACAATAA 

MKKFSLLLAILPFLVACENQATPKETSAQKTIVLATAGDVPPFDYEDKGNLTGFblEVLKAVDEKLSDYEIQFQRTA 
45 W ESIFPGIX>SGHYQAAANNLSYTKERAEKYLYSLPISNNPLVL^^ 

NQKHTDMPATIMFSGEDIGKRILDLANGEFDFLVFDKVSVQKI IKDRGLDLSWDLPSADSPSNYI IFSSDQKEFKE 
QFDKALKELYQDGTLEKLSNTYLGGSYLPDQSQLQZ 



50 



55 



ID215 - 4211.1 



TTTTCAAAACAGTACCAGTGCTAGGCAGGTAAAAACGGAAACCTATACTAATACAGTAACAAATGTCCCTATTGACA 
TACGCTATAATAGTGATAAGTATTTTATTAGCGGTTTTGCTTCAGAAGTATCAGTGGTCTTGACTGGTGCAAATCGC 
CTATCGCTAGCTAGTGAAATGCAAGAAAGTACACGTAAATTCAAGGTTACTGCTGACCTAACAGATGCCGGTGTTGG 
AACGATTGAAGTTCCTTTGAGCATTGAAGATTTACCCAATGGGCTGACCGCTGTGGCGACTCCGCAAAAAATTACAG 
T C AAG ATTGGTAAGAAGG CT GAG AAGGAT AAGGTAAAGATTGT AC CAGAG ATTGACC CT AGT CAAATTG AT AGT CGG 
GTACAAATTGAAAATGTCATGGTGTCAGATAAAGAAGTGTCTATTACGAGTGACCAAGAGACATTGGATAGAATTGA 
TAAGATTATCGCTGTTTTGCCAACTAGCGAACGTATAACAGGTAATTACAGTGGTTCAGTACCTTTGCAGGCAATCG 
ACCGCAATGGTGTTGTCTTACCGGCAGTTATCACTCCGTTTGATACAATAATGAAGGTGACTACAAAACCAGTAGCA 
CCAAGTTCAAGCACATCAAATTCAAGTACAAGCAGTTCATCGGAGACATCTTCGTCAACGAAAGCAACTAGTTCAAA 
OO AACGAATTAA 
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MKKNS LYIISSLF F ACV L FV Y AT ATN FQN ST S ARQ V KT E T Y TNTVTNV P I D I R YNS D KY F I S G FAS EV S WLTG ANR 
LSLASEMQESTRKFKVTADLTDAGVGTIEVPLSIEDLPNGLTAVATPQKITVKIGKKAQKDICVKIVPEIDPSQIDSR 
VQIENVMVSDKEVSITSDQETLDRIDKI IAVLPTSERITGNYSGSVPLQAIDRNGVVLPAVITPFDTIMKVTTKPVA 
5 PSSSTSNSSTSSSSETSSSTKATSSKTMZ 

ID21S - 4127.3 

ATGTTGATTGGCGAAGGGTATCGGACTTTCCCTGTCCTGATTTATACCCAATTTATTAGCGAGGTTGGAGGAAATTC 
TGCTTTTGCAATTATGGCGATTATCATTGCCTTGGCAATTTTCCTTAT CCAAAAACACATTGCAAACCGCTACAGTT 

10 TCAGCATG AAT CTGCTCCATC CAATTG AG CCTAAAAAAACT ACAAAAGG AAAAATGGCTGCCATTTATGCAACAGT C 

TACGGAATTATCTTTATCTCTGTTTTACCTCAAATCTACTTAATTTATACCTCTTTCCTAAAAACATCAGGTATGGT 
ATCTGTTAAAGGTTATTCTCCAAACAGTTACAAGGTAGCTTTCCATCGTATGGGATCTGCTATTTTCAATACCATTC 
GTATCCCTTTGATTGCCTTAGTTCTAGTTGTTCTATTTGCGACATTTATCTCCTACCTAGCCGTTAGAAAACGGAAT 
TTGTTTACAAACTTAATTGACAGCCTCAGTATGGTACCTTATATTGTACCAGGAACCGTTCTAGGGATTGCCTTCAT 

15 TTCTTCCTTCAATACTGGTCTATTTGGAAGTGGATTTCTTATGATTACAGGGACTGCTTTCATCTTGATTATGTCTC 
TATCTGCCAGAAGATTACCTTATACTATTCGCTCATCTGTTGCTAGCTTACAACAAATAGCACCAAGTATTGAAGAA 
GCTGCTGAAAGCTTAGGAAGTAGTCGTCTCAATACCTTTGCTAAGATTACAACTCCAATGATGCTATCTGGTATCAT 
TTCTGGAGCCATCTTATCTTGA 

20 

MLIGEGYRTFPVLI YTQFISEVGGNSAFAIMAI I IAI^IFLIQKHIANTlYSFSMNLLiHPIEPKKTTKGKMAAIYATV 
YGIIFISVLPQIYLIYTSFLKTSGWSVKGYSPNSYKVAFHRMGSAIFNTIRIPLIALVLVVLFATFISYLAVRKRN 
LFTNLIDSLS^fVPYIVPGTVLGIAFISSFNTGLFGSGFtiMITGTAFILIMSLSARRLlPYTIRSSVASLQQIAPSIEE 
' AAESLGSSRLNTFAKITTPMMLSGIISGAILSZ 

25 
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Table 4 
ID301 

ATGAATAAGAAAAAAATGATTTTAACAAGTCTAGCCAGCGTCGCTATCTTAGGGGCTGGTTTTGTTACGTCTCAGCC 
TACTTTTGTAAGAGCAGAAGAATCTCCACAAGTTGTCGAAAAATCTTCATTAGAGAAGAAATATGAGGAAGCAAAAG 
CAAAAG CTG ATACTGC CAAG AAAG ATTACG AAACGGCT AAAAAGAAAG CAG AAG ACGC TCAG AAAAAGT ATGAAG AT 
GATCAG AAGAGAACTGAGGAGAAAG CTCG AAAAG AAGCAG AAG CAT CT CAAAAATTGAATG ATGTGGCG C T TGTTGT 
TCAAAATG CAT ATAAAG AGT ACCG AGAAGTTC AAAATCAACGT AG T AAAT AT AAAT CTGACGCTG AAT AT C AG AAAA 
AATTAACAGAGGTCGACT CT AAAATAGAG AAGG CT AGG AAAG AG CAACAGG ACTTG CAAAAT AAATTT AATGAAGTA 
AGAG CAGTTGTAGTTCCTGAAC CAAATG CGTTGGCTG AGACT AAG AAAAAAG CAG AAG AAGCTAAAGCAG AAG AAAA 
AGT AG CT AAGAGAAAAT ATG ATT ATGCAACTCT AAAGGT AG CACT AG CG AAG AAAG AAGTAGAGGCT AAGG AACTTG 
AAATTGAAAAACTTCAATATGAAATTTCTACTTTGGAACAAGAAGTTGCTACTGCTCAACATCAAGTAGATAATTTG 
AAAAAACTTCTTGCTGGTGCGGATCCTGATGATGGCACAGAAGTTATAGAAGCTAAATTAAAAAAAGGAGAAGCTGA 
GCTAAACGCTAAACAAGCTGAGTTAGCAAAAAAACAAACAGAACTTGAAAAACTTCTTGACAGCCTTGATCCTGAAG 
GTAAGACTCAGGATGAATTAGATAAAGAAGCAGAAGAAGCTGAGTTGGATAAAAAAGCTGATGAACTTCAAAATAAA 
GTTGCTGATTTAGAAAAAGAAATTAGTAACCTTGAAATATTACTTGGAGGGGCTGATCCTGAAGATGATACTGCTGC 
TCTTCAAAATAAATTAGCTGCTAAAAAAGCTGAGTTAGCAAAAAAACAAACAGAACTTGAAAAACTTCTTGAOVGCC 
TTGAT C CTG AAGGTAAG ACT CAGG ATGAATT AGATAAAGAAGCAGAAG AAG CT GAGTTGGAT AAAAAAG CTGATG AA 
CTTC^VAAATAAAGTTGCTGATTTAGAAAAAGAAATTAGTAACCTTGAAATATTACTTGGAGGGGCTGATTCTGAAGA 
TGATACTGCTGCTCTTCAAAATAAATTAGCTACrAAAAAAGCTGAATTGGAAAAAACTCAAAAAG 
CTCTTAATGAGTTAGGCCCTGATGGAGATGAAGAAGAAACTCCAGCGCCGGCTCCTCAACCAGAGCAACCAGCTCCT 
G CACCAAAAC CAGAG CAAC CAG CT C C AGCTC C AAAACCAG AGCAAC CAGCT CCTGCAC CAAAACC AGAGC AAC CAG C 
TCCAG CTC CAAAACCAG AGCAACCAGCTC CAGCT CCAAAAC CAGAG CAAC CAG CTAAG CCGGAGAAACCAGCTGAAG 
AG C CT ACTCAAC CAGAAAAAC CAG C CACTCCAAAAACAGGCTGGAAACAAG AAAACGGT ATGTGGTATTTCTAC AAT 
ACTGATGGTTCAATGGCAATAGGTTGGCTCCAAAACAACGGTTCATGGTACTACCTAAACGCTAACGGCGCTATGGC 
AACAGGTTGGGTG AAAGATGG AGAT ACCTGGTACTAT CTTG AAG CAT C AGGTGCT ATGAAAGC AAG C C AAT GGTTCA 
AAGTAT CAGATAAATGGT ACT ATGTCAACAGCAATGG CGCTATGGCG AC AGGCTGG CTC C AAT ACAATGGCT CATGG 
TACTACCTCAACGCTAATGGTGATATGGCGACAGGATGGCTCCAATACAACGGTTCATGGTATTACCTCAACGCTAA 
TGGTGAT ATGGCGACAGG ATGGGCTAAAGTCAACGG TT CATGGT ACT ACCT AAACGCT AACGGTG CTATGG CT ACAG 
GTTGGGCTAAAGTCAACGGTTCATGGTACTACCTAAACGCTAACGGTTCAATGGCAACAGGTTGGGTGAAAGATGGA 
G ATAC CTGGTACT AT CTTG AAGCATCAGGTGCT ATG AAAGCAAG CCAATGGT T CAAAGTATCAG AT AAATGGTACT A 

TGTCAATGGCTTAGGTGCCCTTGCAGTCAACACAACTGTAGATGGCTATAAAGTCAATGCCAATGGTGAATGGGTTT 
AA 

^kkkmiltsiasvailgagfvtsqptfvraeespqw 

dq krt e e karke ae as q klnd v axi wqna yke y revqnq rs icy ks dae yq kklt e vds k i e karkeqq d lqn kfne v 

RAVWPEPNAIAETKKKAEEAJCAEEKVAKRKTO^ 

KKLLAGADPDDGTEVIEAJCLKKGEAELNAKQAEIiAJCKQTELEKLLDSLDPEGKTQDE 
VADLEKEISNLEILI^GADPEDDTAAiQNKIAAia^ 

LQNKVADLEKEISNLEILLGGAIDSEDDTAALQNKIjATKKA£LEKTQKELDAAX<NELGPDGDEEETPAPAPQPEQPAP 

APKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAKPEKPAEEPTQPEKPATPKTGWKQENGMWYFYN 

TDGSMAIGWLQNNGSWYYLNANGAMATGWKDGDTWYYLEA^ 

YYLNANGDMATGWLQYNGSWYYLNANGDMATGWAKVNGSWYYL^^ 

DTWYYLEASGAMKASQWFKVSDKWYYWGLGALAWTTVDGYKVNANGE 

ID302 

ATGTTTGCATCAAAAAG CGAAAGAAAAGTAC ATTATTCAATT CGTAAATTT AGTGT TGG AGT AGCTAGTGTAGTTGT 
TGCCAGTCTTGTTATGGGAAGTGTGGTTCATGCGACAGAGAACGAGGGAGCTACCCAAGTACCCACTTCTTCTAATA 
GGG CAAATG AAAGTCAGGCAGAACAAGGAGAACAAC CTAAAAAACTCGATTC AG AACG AGAT AAG G CAAGG AAAGAG 
GTCGAGGAATATGTAAAAAAAATAGTGGGTGAGAGCTATGCAAAATCAACTAAAAAGCGACATACAATTACTGTAGC 
TCTAG TT AACGAGTTG AACAACATT AAGAACG AGTATTTGAAT AAAATAGTTG AAT CAAC CT CAG AAAG C CAAC T AC 
AGATACTGATGATGGAGAGTCGATCAAAAGTAGATGAAGCTGTGTCTAAGTTTGAAAAGGACTCATCTTCTTCGTCA 
AGTTCAGACTCTTCCACTAAACCGGAAGCTTCAGATACAGCGAAGCCAAACAAGCCGACAGAACCAGGAGAAAAGGT 
AGCAG AAGCT AAG AAGAAGGTTG AAG AAGC TGAGAAAAAAGC CAAGG ATCAAAAAGAAGAAGAT CGTCGT AACT ACC 
CAACCATTACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATGTGGAAGTTAAAAAAGCGGAGCTTGAACTA 
GTAAAAGTGAAAGCTAACGAACCTCGAGACGAGCAAAAAATTAAGCAAGCAGAAGCGGAAGTTGAGAGTAAACAAGC 
TG AGG C TACAAGGTTAAAAAAAATCAAG ACAG ATCGTG AAG AAGCAG AAG AAGAAG CT AAACG AAG AGCAG ATGCT A 
AAGAG CAAGGT AAACCAAAGGGG CGGGCAAAACG AGG AGTTC CTGGAGAG CTAGCAAC AC CTGAT AAAAAAG AAAAT 
GATG CGAAGTCTT CAGATTCTAGCGT AGGTG AAG AAACTCTT CCAAG C CC ATCCCTGAAACCAG AAAAAAAGGTAGC 
AGAAGCTG AG AAG AAGGTTG AAG AAGCT AAG AAAAAAG C CGAGGATCAAAAAG AAGAAG AT CGC CGT AACT AC CC AA 
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CCAATACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATGTGGAAGTTAAAAAAGCGGAGCTTGAACTAGTA 
AAAGAGGAAGCTAAGGAACCTCGAAACGAGGAAAAAGTTAAGCAAGCAAAAGCGGAAGTTGAGAGTAAAAAAGCTGA 
GG CT ACAAGGTTAG AAAAAAT C AAGACAG ATCGTAAAAAAGC AG AAG AAG AAGCT AAACG AAAAGCAGC AG AAGAAG 
ATAAAGTTAAAGAAAAACCAGCTGAACAACCACAACCAGCGCCGGCTCCAAAAGCAGAAAAACCAGCTCCAGCTCCA 
AAACCAGAGAATCCAGCTGAACAACGAAAAGCAGAAAAACCAGCTGATCAACAAGCTGAAGAAGACTATGCTCGTAG 
ATCAGAAGAAGAATATAATCGCTTGACTCAACAGCAACCGCCAAAAACTGAAAAACCAGCACAACCATCTACTCCAA 
AAACAGGCTGGAAACAAGAAAACGGTATGTGGTACTTCTACAATACTGATGGTTCAATGGCGACAGGATGGCTCCAA 
AACAATGGCTCATGGTACTACCTCAACAGCAATGGCGCTATGGCGACAGGATGGCTCCAAAACAATGGTTCATGGTA 
CTATCT AAACG CTAATGGTTCAATGGCAACAGGATGGCTCCAAAACAATGGTTCATGGTACTACCTAAACGCTAATG 
GTT CAATGG CGACAGGATGGCT C CAATACAATGGC TCATGGT ACT AC CT AAACGCTAATGGTTCAATGGCGACAGG A 
TGGCTCCAATACAATGGCTCATGGTACTACCTAAACGCTAATGGTGATATGGCGACAGGTTGGGTGAAAGATGGAGA 
TACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAAGCCAATGGTTCAAAGTATCAGATAAATGGTACTATG 
TCAATGGCTCAGGTGCCCTTGCAGTCAACACAACTGTAGATGGCTATGGAGTCAATGCCAATGGTGAATGGGTAAAC 
TAA 

MFASKSERKVHYSIRKFSVGVASVVVASLVTCGSVVHATENEGATQVPTSSNRANESQ^^ 
VEEYVKKIVGESYAKSTKKRHTITVALVNEI^IIOIEYL^ 

SSDSSTKPEASDTAKPNKPTEPGEKVAEAXKKVEEAEKKAKDQKEEDRRNYPTITYKTLELEIAESDVEVKKAELEL 
VKVKANEPRDEQKIKQAEAEVESKQAEATRLKKIKTDREEAEEEAKRRADAKEQGKPKGRAKRGVPGELATPDKKEN 
DAKSSDSSVGEETLPSPSLKPEKKVAEAEKKVEEAKKKAEDQKEEDRR^PTO^ 
ECEEAKEPRNEEKVKQAKAEVESKKAEATRLEKIKTDRKKAEEEAKRKAAEEDKVKEK^ 

KPENPAEQPKAEKPADQQAEEDYARRSEEEYNRLTQQQPPKTEKPAQPSTPKTGWKQENGMWYFYNTDGSMATGWLQ 

NNGSWYYLNSNGAKATGWLQNNGSWYYI^ANGSHATGWLQNNGSWYY 

WLQYNGSWYYLNANGDMATGWVKDGDTWYYLEASGAMKASQWFKVSDKWYYVNGSGALA 

Z 

ID303 

ATGGTAAAAAGACGTATAAGGAGAGGGACGAGAGAACCTGAAAAAGTTGTTGTTCCTGAGCAATCATCTATTCCTTC 

GTATCCTGT ATCTGTTACATCT AAC CAAGG AAC AGATGTAGCAGT AG AAC CAG CTAAAGCAGT TG CTCCAACAACAG 

ACTGGAAACAAGAAAATGGTATGTGGTATTTTTATAATACTGATGGTTCCATGGCAACAGGTTGGGTACAAGTTAAT 

AGTTCATGGTACTACCTCAACAGCAACGGTTCTATGAAAGTCAATCAATGGTTCCAAGTTGGTGGTAAATGGTATT 

TGTAAATACATCGGGTGAGTTAGCGGTCAATACAAGTATAGATGGCTATAGAGTCAATGATAATGGTGAATGGGTGC 

GTTAA 

MVKRRIRRGTREPEKWVPEQSSIPSYPVSVTSNQGTDVAVEPAKAVAPTTD^ 
SSWYYLNSNGSMKVNQWFQVGGKWYYVNTSGELAVNTSIDGYRVNDNGEWVRZ 

ID3 04 

CTGAATACAAGTTTTGTTCATGCTGCTGATGGGATTCAATATGTCAGAGATGATACTAGAGATAAAGAAGAGGGAAT 
AGAGTATGATGACGCTGACAATGGGGATATTATTGTAAAAGTAGCGACTAAACCTAAGGTAGTAACCAAGAAAATTT 
CAAGTACGCGAATTCGTTATGAAAAAGATGAAACAAAAGACCGTAGTGAAAATCCTGTTACAATTGATGGAGAGGAT 
GGCTATGTAACTACGACAAGGACCTACGATGTTAATCCAGAGACTGGTTATGTTACCGAACAGGTTACTGTTGATAG 
AAAAGAAGCCACGGATACAGTTATCAAAGTTCCAGCTAAAAGCAAGGTTGAAGAAGTTCTTGTTCCATTTGCTACTA 
AATATGAAGCAGACAATGACCTTTCTGCAGGACAGGAGCAAGAGATTACTCTAGGAAAGAATGGGAAAACAGTTACA 
ACGAT AAC TTAT AATGT AGATGGAAAG AGTGGAC AAGTAACTG AG AGTACTTTAAGTCAAAAAAAAGACTCT C AAAC 
AAGAGTTGTTAAAAAAAGAACCAAGCCCCAAGTTCTTGTCCAAGAAATTCCAATCGAAACAGAATATCTCGATGGCC 
CAACTCTTGATAAAAGTCAAGAAGTAGAAGAAGTAGGAGAAATTGGTAAATTACTCTTACTACAATCTATACTGTAG 

LNTSFVHAADGIQYVRDDTRDKEEGIEYDDADNGDI IVKVATKPKWTKKISSTRIRYEKDETKDRSENPVTIDGED 
GWTTTRTYDVNPETGYVTEQVTVT)RKEATDWIKVPAKSICV^^ 

TITYNVDGKSGQVTESTLSQKKDSQTRWKKRTKPQVLVQE I PIETE YLDGPTLDKSQEVEEVGEIGKIjLLLQS I LZ 



ID3 05 

ATGAAGCTTTTGAAAAAAATGATGCAAATCGCACTAGCCACATTTTTCTTCGGTTTGTTAGCGACAAATACAGTATT 
TGCAGATGATTCTGAAGGATGGCAGTTTGTCCAAGAAAATGGTAGAACCTACTACAAAAAGGGGGATCTAAAAGAAA 

cctactggagagtgatagatgggaagtactattattttgatcctttatccggagagatggttgtcggctggcaatat 
atacctgctccacacaagggggttacgattggtccttctccaagaatagagattgctcttagaccagattggtttta 
ttttggtcaagatggtgtattacaagaatttgttggcaagcaagttttaga^ 

aacatcatggggaagaatatgatagccaagcagagaaacgagtctattattttgaagatcagcgtagttatcatact 
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TTAAAAACTGGTTGGATTTATGAAGAGGGTCATTGGTATTATTTACAGAAGGATGGTGGCTTTGATTCGCGCATCAA 
CAGATTGACGGTTGGAGAGCTAGCACGTGGTTGGGTTAAGGATTACCCTCTTACGTATGATGAAGAGAAGCTAAAAG 
CAGCTCCATGGTACTATCTAAATCCAGCAACTGGCATTATGCAAACAGGTTGGCAATATCTAGGTAATAGATGGTAC 
TACCTCCATTCGTCAGGAGCTATGGCAACTGGCTGGTATAAGGAAGGCTCAACTTGGTACTATCTAGATGCTGAAAA 
TGGTGATATGAGAACTGGCTGGCAAAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGGAGCTATGGCAACTG 
GTTGGTATCAGGAAAGTTCGACTTGGTACTATCTAAATGCAAGTAATGGAGATATGAAAACAGGCTGGTTCCAAGTC 
AATGGTAACTGGTACTATGCCTATGATTCAGGTGCTTTAGCTGTTAATACCACAGTAGGTGGTTACTACTTAAACTA 
TAATGGTGAATGGGTTAAGTAA 

MKLLKKMMQIALATFFFGLl^TirrVFADDSEGWQFVQENGRTYYKKGDLKETYW 

IPAPHKGVTIGPSPRIEIALRPDWFYFGQDGVLQEFVGKQVLEAKTATNTNKHHGEEYDSQAEKRVYYFEDQRSYHT 
LKTGWIYEEGHWYYLQKDGGFDSRrNRLTVGEI^GWVTOYPLTYDEEKXKAAPWYYLNPATGIMQTGWQYLGNRWY 
Yl^SSGAMATGWYKEGSTWYYLDAENGDMRTGWQNLGNKWYYLRSSGAMATGWYQESSTWYYLNASNGDMKTGWFQV 
NGNW Y Y A YDSG ALAVNTTVGG Y Y LN YNG E WV K Z 



ID306 

TTGGCTGGTAGATATGGTTCTGCTGTTCAGTGTACAGAAGTGACTGCCTCAAACCTTTCAACAGTTAAAACTAAAGC 
TACGGTTGTAGAAAAACCACTGAAAGATTTTAGAGCGTCTACGTCTGATCAGTCTGGTTGGGTGGAATCTAATGGTA 
- AATGGTATTTCTATGAGTCTGGTGATGTGAAGACAGGTTGGGTGAAAACAGATGGTAAATGGTACTATTTGAATGAC 
20 TTAGGTGTCATGCAGACTGGATTTGTAAAATTTTCTGGTAGCTGGTATTACTTGAGCAATTCAGGTGCTATGTTTAC 
AGGCTGGGGAACAGATGGTAGCAGATGGTTCTACTTTGACGGCTCAGGAGCTATGAAGACAGGCTGGTACAAGGAAA 
ATGGCACTTGGTATTACCTTGACGAAGCAGGTATCATGAAGACAGGTTGGTTTAAAGTCGGACCACACTGGTACTAT 
GCCTACGGTTCAGGAGCTTTGGCTGTGAGCACAACAACACCAGATGGTTACCGTGTAAATGGTAATGGTGAATGGGT 
AAACTAG 



IJ\GRYGSAVQCTEVTASNLSTVKTKATWEKPLKDFRASTSDQ 

LGVMQTGFVKFSGSWYYLSNSGAMFTGWGTDGSRWFYFDGSGAMKTGWYKENGTWYYLDEAGIMKTGWFKVGPHWYY 
A YGS G ALiAVS TTT P DG YRVNGNG E WVN 2 



ID3Q7 

ATGAAAATTTTGAAAAAAACTATGCAAGTTGGACTGACAGTATTTTTCTTTGGTTTGCTAGGGACCAGTACAGTATT 
TGCAG ATG ATTCTGAAGGATGGCAGTTTGTCCAAG AAAACGG AAGAAC CTACTAC AAAAAGGGGGAC CTCAAAGAAA ' 
CCTACTGGCGAGTGATTGATGGTAAGTACTATTATTTTGATTCTCTATCTGGAGAGATGGTTGTCGGCTGGCAATAT 
ATCCCGTTTCCATCTAAAGGTAGTACAATTGGTCCTTACCCAAATGGTATCAGATTAGAAGGTTTTCCAAAGTCAGA 

35 GTGGTACTACTTCGATAAAAATGGAGTGCTACAAGAGTTTGTTGGTTGGAAAACATTAGAGATTAAAACTAAAGACA 
GTGTTGGAAGAAAGTACGGGGAAAAACGTGAAGATTCAGAAGATAAAGAAGAGAAGCGTTATTATACGAACTATTAC 
TTT AATCAAAAT CATTCTTT AGAG ACAGGTTGG CTTT ATG AT CAGTCT AACTGGT ATTATCTAGCT AAGACGG AAAT 
TAAT GG AG AAAACT ACCTTGGTGGTGAAAGACGTG CGGGGTGG AT AAACGATG ATT CGACTTGGTACT ACCT AG ATC 
CAACAACTGGT ATT ATG CAAACAGGTTGGCAAT ATCTAGGT AATAAGTGGT ACT AC CT CCGTT CCT CAGGAG CAATG 

40 GCCACTGGCTGGTATCAGGAAGGTACCACTTGGTATTATTTAGACCACCCAAATGGCGATATGAAAACAGGTTGGCA 
AAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGGAGCTATGGCAACTGGTTGGTATCAAGATGGTTCAACTT 
GGTACTACCTAAATGCAGGTAATGGAGACATGAAGACAGGTTGGTTCCAGGTCAATGGCAACTGGTACTATGCTTAT 
AGCTCAGGTGCTTTGGCAGTGAATACGACCGTAGATGGCTATTCTGTCAACTATAATGGCGAATGGGTTCGGTAA 

45 MKILKKTMQVGLTVFFFGLLGTSTVFADDSEGWQFVQENGRTYYKKGDLKETYWRVIDGKYYYFDSLSGEMWGWQY 
IPFPSKGSTIGPYPNGIRLEGFPKSEWYYFDKNGVLQEFVGWKTLEIKTKDSVGRKYGEKREDSEDKEEKRYYTNYY 
FNQNHSLETGWLYDQSNWYYIJVKTEINGENYLGGERRAGWiraDSTWYYLDPTTGIMQTGWQYLGNKWYYLRSSGAM 
ATGWYQEGTTWYYLDHPNGDMKTGWQNLGNKWYYLRSSGAMATGWYQDGSTWYYLNAGNGDMKTGWFQVNGNWY 
SSGALAVNTTVDGYSVNYNGEWVRZ 



ID3 08 

ATGAAAAAGAAATTAACTAGTTTAGCACTTGTAGGCGCTTTTTTAGGTTTGTCATGGTATGGGAATGTTCAGGCTCA 
AGAAAGTTCAGGAAATAAAATCCACTTTATCAATGTTC^GAA 

GACATTTTGCCATGGTGGATACAGGAGAAGATTATGATTTCCCAGATGGAAGTGATTCTCGCTATCCATGGAGAGAA 
GGAATTGAAACGTCTTATAAGCATGTTCTAACAGACCGTGTCTTTCGTCGTTTGAAGGAATTGGGTGTCCAAAAACT 
TGATTTTATTTTGGTGACCCATACCCACAGTGATCATATTGGAAATGTTGATGAATTACTGTCTACCTATCCAGTTG 
ACCGAGTCTATCTTAAGAAATATAGTGATAGTCGTATTACTAATTCTGAACGTCTATGGGATAATCTGTATGGCTAT 
GATAAGGTTTTACAGACTGCTGCAGAAAAAGGTGTTTCAGTTATTCAAAATATCACACAAGGGGATGCTCATTTTCA 
GTTTGGGGACATGGATATTCAGCTCTATAATTATGAAAATGAAACTGATTCATCGGGTGAATTAAAGAAAATTTGGG 
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ATGACAATTCCAATTCCTTGATTAGCGTGGTGAAAGTCAATGGCAAGAAAATTTACCTTGGGGGCGATTTAGATAAT 
GTTCATGGAGCAGAAGACAAGTATGGTCCTCTCATTGGAAAAGTTGATTTGATGAAGTTTAATCATCACCATGATAC 
CAACAAATCAAATACCAAGGATTTCATTAAAAATTTGAGTCCGAGTTTGATTGTTCAAACTTCGGATAGTCTACCTT 
GGAAAAATGGTGTTG ATAGTG AGTATGTTAATTGG CTCAAAGAACG AGG AATTG AG AG AAT C AACG CAGCCAG CAAA 
GACTATGATGCAACAGTTTTTGATATTCGAAAAGACGGTTTTGTCAATATTTCAACATCCTACAAGCCGATTCCAAG 
TTTTCAAGCTGGTTGGCATAAGAGTGCATATGGGAACTGGTGGTATCAAGCGCCTGATTCTACAGGAGAGTATGCTG 
TCGGTTGGAATGAAATCGAAGGTGAATGGTATTACTTTAACCAAACGGGTATCTTGTTACAGAATCAATGGAAAAAA 
TGGAACAATCATTGGTTCTATTTGACAGACTCTGGTGCTTCTGCTAAAAATTGGAAGAAAATCGCTGGAATCTGGTA 
TTATTTTAACAAAGAAAACCAGATGGAAATTGGTTGGATTCAAGATAAAGAGCAGTGGTATTATTTGGATGTTGATG 
GTTCTATGAAGACAGGATGGCTTCAATATATGGGGCAATGGTATTACTTTGCTCCATCAGGGGAAATGAAAATGGGC 
TGGGTAAAAGATAAAGAAACCTGGTACTATATGGATTCTACTGGTGTCATGAAGACAGGTGAGATAGAAGTTGCTGG 
TCAACATT ATT AT CTGGAAG ATTCAGG AG CT ATG AAG C AAGG CTGGCAT AAAAAGG CAAATGATTGGT AT TT CTACA 
AGACAGACGGTTCACGAGCTGTGGGTTGGATCAAGGACAAGGATAAATGGTACTTCTTGAAAGAAAATGGTCAATTA 
CTTGTGAACGGTAAGACACCAGAAGGTTATACTGTGGATTCAAGTGGTGCCTGGTTAGTGGATGTTTCGATCGAGAA 
AT CTG CT ACAATT AAAAC TAC AAGTCATTCAG AAAT AAAAG AAT CC AAAG AAGT AGTGAAAAAGG ATCTTGAAAAT A 
AAGAAACGAGTCAACATGAAAGTGTTACAAATTTTTCAACTAGTCAAGATTTGACATCCTCAACTTCACAAAGCTCT 
GAAACGAGTGTAAACAAATCGGAATCAGAACAGTAG 

MKKK^TSU^VGAFLGLSWYGNVQAQESSGNKIHFINVQEGGSDAIILESNGHFAMVDTGEDYDFPDGSDSRYPWRE 
GIETSYKHVLTDRVFRRLKELGVQKLDFILVTHTHSDHIGMVDELLSTYPVDRVYLKKYSDSRITNSERLWDNLYGY 
DKVLQTAAEKGVSVIQNITQGDAHFQFGDMDIQLYNYENETDSSGELKKIWDDNSNSLISVVKVNGKKIYLGGDLDN 
VHGAEDKYGPLIGKVTlLMKFNHHHDTNKSNTKE^ 

DYDATVFD IRKDGFVNI STS YKP I PS FQAGWHKSAYGNWWYQAPDSTGE YAVGWNE I EGEWYYFNQTGI LLQNQWKK 
WNNHWFYLTDSGASAKNWKKIAGIWYY FNKENQMEIGWIQDKEQWYYLDVDGSMKTGWLQYMGQWYYFAP SGEMKMG 
VA/KDKETWYYT^STGVMKTGEIEVAGQHYYLEDSGAMKQGWHKKANDW 

LVNGKTPEGYTVDSSGAWLVDVSIEKSATIKTTSHSErKESKEVVKKDLENKETSQHESVTNFSTSQDLTSSTSQSS 
ETSVNKSESEQZ 

ID309 

ATGGAAATTAATGTGAGTAAATTAAGAACAGATTTGCCT.CAAGTCGGCGTGCAACCATATAGGCAAGTACACGCACA 
CTCAACTGGGAATCCGCATTCAACCGTACAGAATGAAGCGGATTATCACTGGCGGAAAGACCCAGAATTAGGTTTTT 
TCTCGCACATTGTTGGGAACGGTTGCATCATGCAGGTAGGACCTGTTGATAATGGTGCCTGGGACGTTGGGGGCGGT 
TGGAATGCTGAGACCTATGCAGCGGTTGAACTGATTGAAAGCCATTCAACCAAAGAAGAGTTCATGACGGACTACCG 
C CTTTAT AT C G AACTCTTACG CAAT CT AG CAGATG AAGC AGGTTTG C CG AAAACG CTTG AT AC AGGGAGTTT AGCTG 
GAATTAAAACGCACGAGTATTGCACGAATAACCAACCAAACAACCACTCAGACCACGTTGACCCTTATCCATATCTT 
G CT AAATGGGGCATT AG CCGTG AG C AGTT T AAGC ATG AT ATTGAG AACGG CTTGACG ATTG AAACAGGCTGG C AG AA 
GAATG ACACTGG CT ACTGGTACGT AC ATT CAGACGG CT CTT ATCCAAAAGACAAGTTTG AGAAAATCAATGGCACTT 
GGTACTACTTTGACAGTTCAGGCTATATGCTTGCAGACCGCTGGAGGAAGCACACAGACGGCAACTGGTACTGGTTC 
GAGAACTCAGGCGAAATGGCTAGAGGCTGGAAGAAAATCGCTGATAAGTGGTACTATTTGAACGAAGAAGGTGCCAT 
GAAGACAGGCTGGGTCAAGTACAAGGACACTTGGTACTACTTAGACGCTAAAGAAGGCGCCATGGTATCAAATGCCT 
TTAT C C AGTCAG CGG ACGGAAC AGGC TGGT ACT AC CT C AAACCAGACGG AACACTGG CAGAC AAG CCAGAATTCACA 
GTAGAGCCAGATGGCTTGATTACAGTAAAATAA 

MEI^SKXRTDLPQVGVQPYRQVHAHSTGNPHSWQNEADYHWRKDPELGFFSH^ 

WNAETYAAVELIESHSTKEEFMTDYRLYIELLRNIADEAGLPKTLDTGSLAGIKTHEYCTNNQPNNHSDHVDPYPYL 
AKWGISREQFKHDIENGLriETGWQKNDTGTOYVHSDGSYPKDKFEKIN^ 

DNSGEI^TGWKKIADKWYYFNEEGAMKTGWVKYKDTWYYLDAKEGA^rVSNAFIQSADGTGWYYLKPDGTLADKPEFT 
VEPDGLITVKZ 

ID310 

ATGGGCACAACAGGATTTACAATAATTGACTTAATTATCTTGATTGTTTATTTACTTGCGGTGTTGGTTGCAGGTAT 
CTATTTCTCTAAAAAAGAGATGAAAGGAAAAGAGTTCTTTAAAGGAGATGGTTCGGTTCCTTGGTATGTTACTTCGG 
TATCCATTTTTGCCACAATGCTCAGTCCGATTTCCTTCTTGGGACTCGCTGGTAGCTCTTATGCAGGTAGCTGGATT 
TTATGGTTTGCTCAATTAGGGATGGTAGTAGCTATTCCACTGACAATTCGTTTTATCTTACCTATCTTTGCACGGAT 
AGACATCGATACGGCATATGATTACTTGGATAAACGTTTTAATTCTAAAGCACTTCGTATTATTTCAGCACTCTTGT 
TTATTATTTATCAATTGGGACGTATGTCTATCATTATGTACCTCCCATCAGCTGGTTTATCAGTATTGACAGGAATT 
GACATCAATATTTTGATTATTTTGATGGGTGTAGTTGCAATTGTTTATTCTTATACTGGTGGTCTAAAATCCGTATT 
ATGGACAGACTTTATTCAAGGTGTGATTCTGATTAGTGGTGTCGTTTTAGCTTTATTTGTACTGATTGCTAATATTA 
AAGGTGGCTTTGGTGCAGTAGCAGAAACATTAGCAAACGGGAAATTCCTTGCTGCAAATGAAAAACTTTTCGATCCT 
AACTTGCTTTCAAACTCCATCTTTTTAATTGTGATGGGTTCAGGCTTTACAATCTTGTCTTCCTATGCTTCATCTCA 
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AGATTTGGTTCAACGTTTTACTACAACACAAAATATTAAGAAACTTAATAAGATGTTGTTCACAAACGGTGTTTTGT 
CACTTGCAACTGCAACAGTCTTTTACTTGATTGGTACAGGCTTGTACGTATTCTATCAAGTACAAAATGCAGATAGT 
GCAGCTAGCAATATCCCTCAAGACCAAATCTTTATGTACTTTATTGCATACCAGTTACCAGTAGGTATCACAGGTTT 
GATCTTGGCAGCGATTTATGCAGCATCTCAATCAACTATTTCAACAGGTTTGAACTCTGTTGCAACTTCATGGACAT 
TGGATATTCAAGATGTCATTTCTAAAAATATGTCAGACAATCGTCGTACGAAAATTGCACAATTCGTATCTCTAGCA 
GTAGGTTTATTCTCAATTGGTGTTTCCATTGTCATGGCTCACTCAGATATTAAATCTGCATACGAATGGTTCAATAG 
rTTCATGGGACTTGTACTTGGTCTACTTGGTGGTGTATTTATTCTTGGATTTGTTTCTAAAAAAGCAAATAAACAAG 
GTGCTTATGCAGCGCTGATTGTATCAACCATCGTCATGGTATTTATTAAATACTTCCTTCCTCCAACAGCTGTTAGC 
TACTGGGCATATTCATTGATTT CAATCTCTGT AT CAGT AGTTTCAGGTTATATTGT ATCTGTT CTTACTGGAAATAA 
AGT AT CTGC AC CT AAAT ATACAACGATTCATG ATATT AC AG AAAT TAAAGCGG ATTCAAGTTG GGAAGTTCGTCACT 
AA 



MGTTGFT 1 1 DLI IL I VYLLAVLVAGI YFSKKEMKGKE FFKGDGSVPWYVTSVS I FATMLS PIS FLGLAGSS YAGSW I 
LWFAQLGMWAI PLTI RFI LP I FAR I DI DTAYD YLDKRFNS KALR 1 1 SALLF 1 1 YQLGRMS 1 1 MYL PSAGLSVLTG I 
ID DINILIILMGWAIVYSYTGGLKSVLWTDFIQGVILISGWLAL 
NLLSNSIFLIVMGSGFTILSSYASSQDLVQRFTTTQNIKKXNK^^ 

AASN I PQDQ I FMYF I AYQL PVGI TGL I LAA I YAASQST I STGLNS VATSWTLD IQDV IS KNMS DMRRTKI AQ FVS LA 
VGLFSIGVSIVMAHSDIKSAYEWFNS FMGLVLGLLGGVFILGFVSKKANKQGAYAALIVSTIVMVFIKYFLPPTAVS 
YWAYS LI S I S VS WSG YI VSVLTGNKVS AP KYTT I HD I TE I KADS SWE VRHZ 

ID311 

ATGAAAATTAATAAAAAATATCTAGCAGGTTCAGTGGCAGTCCTTGCCCTAAGTGTTTGTTCCTATGAGCTTGGTCG 
TCACCAAGCTGGTCAGGATAAGAAAGAGTCTAATCGAGTTGCTTATATAGATGGTGATCAGGCTGGTCAAAAGGCAG 
AAAACTTGACACC AG ATGAAGTCAGTAAGAGGG AGGGG ATCAACG CCG AACAAATCGT CAT C AAGATT ACGG ATCAA 
GGTT ATGTGAC CT CTCATGGAG AC C ATT AT C ATTACTATAATGG C AAGGT CCCT TATG ATG C CAT CATCAGTG AAG A 
GCTCCTCATGAAAGATCCGAATTATCAGTTGAAGGATTCAGACATTGTCAATGAAATCAAGGGTGGTTATGTCATCA 
AGGTAGACGGAAAATACTATGTTTACCTTAAGGATGCAGCTCATGCGGATAATATTCGGACAAAAGAAGAGATTAAA 
CGT CAG AAG CAGGAACGCAGT CATAATCACGGGTCAGG AGC TAACG ATCATGCAGTAGCTG C AG C CAGAG CC C AAGG 
ACGCTATACAACGGATGATGGGTATATCTTCAATGCATCTGATATCATTGAGGACACGGGTGATGCTTATATCGTTC 
CTCACGGCGACCATTACCATTACATTCCTAAGAATGAGTTATCAGCTAGCGAGTTAGCTGCTGCAGAAGCCTATTGG 
AATGGGAAGCAGGG AT CT CGTCCTTCTTCAAGTTCTAGTTATAATGCAAAT CCAGCTCAAC CAAGATTGTCAG AGAA 
CCACAATCTGACTGTCACTCCAACTTATCATCAAAATCAAGGGGAAAACATTTCAAGCCTTTTACGTGAATTGTATG 
CTAAACCCTTATCAGAACGCCATGTGGAATCTGATGGCCTTATTTTCGACCCAGCGCAAATCACAAGTCGAACCGCC 
AGAGGTGTAGCTGTCCCTCATGGTAACCATTACCACTTTATCCCTTATGAACAAATGTCTGAATTGGAAAAACGAAT 
TGCTCGTATTATTCCCCTTCGTTATCGTTCAAACCATTGGGTACCAGATTCAAGACCAGAACAACCAAGTCCACAAT 
CGACTCCGGAACCTAGTCCAAGTCCGCAACCTGCACCAAATCCTCAACCAGCTCCAAGCAATCCAATTGATGAGAAA 
TTGGTCAAAGAAGCTGTTCGAAAAGTAGGCGATGGTTATGTCTTTGAGGAGAATGGAGTTTCTCGTTATATCCCAGC 
CAAGG ATCTTTCAGCAGAAAC AG CAGCAGGCATTG AT AGCAAACTGGCCAAG C AGGAAAGTTTATCT C AT AAGC TAG 
GAGCTAAGAAAACTGACCTCCCATCTAGTGATCGAGAATTTTACAATAAGGCTTATGACTTACTAGCAAGAATTCAC 
CAAGATTTACTTGATAATAAAGGTCGACAAGTTGATTTTGAGGCTTTGGATAACCTGTTGGAACGACTCAAGGATGT 
CCCAAGTGATAAAGTCAAGTTAGTGGATGATATTCTTGCCTTCTTAGCTCCGATTCGTCATCCAGAACGTTTAGGAA 
AACCAAATGCGCAAATTACCTACACTGATGATGAGATTCAAGTAGCCAAGTTGGCAGGCAAGTACACAACAGAAGAC 
GGTTATATCTTTGATCCTCGTGATATAACCAGTGATGAGGGGGATGCCTATGTAACTCCACATATGACCCATAGCCA 
CTGGATTAAAAAAGATAGTTTGTCTGAAGCTGAGAGAGCGGCAGCCCAGGCTTATGCTAAAGAGAAAGGTTTGACCC 
CT CCTTCGACAGACCAT CAGGATTCAGG AAAT ACTGAGGC AAAAGGAG CAGAAGCTAT CTACAA C CG CGTG AAAGCA 
G CTAAG AAGGTG CCACTTGATCGTATGCCTTAC AATC TTCAATAT ACTGTAGAAGTCAAAAACGGTAGTTT AAT CAT 
ACCTCATTATGACCATTACCATAACATCAAATTTGAGTGGTTTGACGAAGGCCTTTATGAGGCACCTAAGGGGTATA 
CTCTTGAGGATCTTTTGGCGACTGTCAAGTACTATGTCGAACATCCAAACGAACGTCCGCATTCAGATAATGGTTTT 
GGTAACGCT AGCGAC CATG TTCAAAG AAACAAAAATGGT C AAG CTG AT AC C AATCAAACGG AAAAACC AAG CG AGG A 
G AAACCTCAGACAGAAAAACCTGAGG AAG AAACC CCT CG AG AAG AGAAAC CG CAAAG CG AGAAACC AG AGT CTC CAA 
AACCAACAGAGGAACCAGAAGAATCACCAGAGGAATCAGAAGAACCTCAGGTCGAGACTGAAAAGGTTGAAGAAAAA 
CTGAGAGAGGCTG AAGATT T ACTTGG AAAAATC C AGG ATCCAATTATC AAGT C CAATGCC AAAGAG ACTCTCACAGG 

ATTAAAAAATAATTTACTATTTGGCACCCAGGACAACAATACTATTATGGCAGAAGCTGAAAAACTATTGGCTTTAT 
^ TAAAGGAGAGTAAGTAA 

MKINKKYLAGSVAVLALSVCSYELGRHQAGQDKKESNRVAYIDGDQAGQFCAENLTPDEVSKREGINAEQIVIKITDQ 
GYVTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQLI^SDira^ 

RQKQERSHNHGSGANDHAVAAARAQGRYTTDDG YI FNAS D I r EDTGDAY I VPHGDHYH YI P fCNELSAS ELAAAEAYW 
NGKQGSRPSSSSSYNANPAQPRLS ENHMLTVTPTYHQNQGEN ISSLLRELYAKPLSERHVESDGLI FDPAQ ITSRTA 
OU RGVAVPHGNHYHFI PYEQMSELEKRI ARI I PLRYRSNHWVPDSRPEQPS PQSTPEPSPS PQPAPNPQPAPSNP IDEK 
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LVKEAVRKVGDGWFEENGVSRYIPAKDLSAETAAGIDS 
QDLLDNKGRQVDFEALDNLLERLKDVPSDKVKLVDDILAFI^ 

GYIFDPRDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTE1AKGAEAI YNRVKA 
AKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNI KFEWFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGF 
GNASDHVQRNKNGQADTNQTEKPSEEKPQTEKPEEETPREEKPQSEKPESPKPTEEPEESPEESEEPQVETEKVEEK 

LREAEDLLGKIQDPI I KSNAKETLTGLKimiiLFGTQDNNTIMAEAEICLLALLKESKZ 
ID312 

ATGGAGGGATTGGTTAGAGTGCATTTATTGCCTGTATTTGGCGATTACAAGCTATCTAAACTTACTACGCCTATTCT 
T C AAC AGCAAGTAAACAAATGGG CTG AC AAGG CAAAT AAAGGCGAAAAAGGGGCATTTGCT AACTACTCTTTGCT C C 
ATAACATGAATAAGCGTATTTTGAAATATGGCGTAGCTATCCAGGTAATACAATACAACCCAGCTAATGATGTCATC 

GTTC CACGCAAACAGC AAAAAG AAAAGG CTG CTGT CAAAT ACT TAG AC AAC AAAGAATTAAAACAG TTT CTTG ATTA 
TTTAGATGCTCTGGATCAATCAAATTATGAGAACTTATTTGATGTTGTTCTGTATAAGACTTTATTGGCCACTGGTT 

G CCGT ATTAGTG AGGCTCTGG CT CTTGAATGG T CTG AT AT TG ACCT AG AAAG CGGTGTT ATCAG CATCAATAAG AC A 
CTAAACCGCTATCAGGAAATAAACTCACCTAAATCAAGCGCTGGTTATCGTGATATACCAATAGACAAAGCCACATT 
ACTTTTACTGAAACAATACAAAAACCGTCAACAAATTCAGTCTTGGAAATTAGGCCGATCTGAAACAGTTGTATTCT 

CTGT ATTTACGG AGAAATATGCTTATG CTTGTAACTT ACG CAAACGC C T AAAT AAGC ATTT TGATG CTG CT GG AGT A 
ACTAACGTATCATTTCATGGTTTCCGCCATACACATACTACTATGATGCTCTATGCTCAGGTTAGCCCGAAAGATGT 
TCAGTATAGATTAGGCCACTCTAATTTAATGATCACTGAAAATACTTACTGGCATACTAACCAAGAGAATGCAAAAA 

AAGCCGTCTCAAATTATGAAACAGCTATCAACAATTTATAA 

MEGLVRVHLLPVFGDYKLSKLTTPrLQQQVNKWADKANra 
VPRKQQKEICAAVKYLDNKELKQFLDYLDAIJDQSNYENLFDW 
L^YQEINS PICSSAGYRDIPIDKATLLLLKQYKNRQQIQSWKLG 

TITVSFHGFRHTHTTmLYAQVSPKDVQYRLGHSNLMITENTYWHTNQENAKKAVSNYETAINNLZ 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 PCT/GB99/02451 

101 

CLAIMS: 

1 . A Streptococcus pneumoniae protein or polypeptide having a sequence 
selected from those shown in table 2. 

5 

2. A Streptococcus pneumoniae protein or polypeptide having a sequence 
selected from those shown in table 4. 

3. A protein or polypeptide as claimed in claim 1 or claim 2 provided in 
10 substantially pure form. 

4. A protein or polypeptide which is substantially identical to one defined in any 
one of claims 1 to 3. 

15 5. A homologue or derivative of a protein or polypeptide as defined in any one of 
claims 1 to 4. 

6. An antigenic and/or immunogenic fragment of a protein or polypeptide as 
defined 

20 in Tables 2-4. 

7. A nucleic acid molecule comprising or consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 1 or their RNA equivalents; 

25 

(ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 
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(iv) a sequence which is substantially identical with any of those of (i), (ii) 
and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 1 . 

8. A nucleic acid molecule comprising or consisting of a sequence which is: 



(i) any of the DNA sequences set out in Table 4 or their RNA equivalents; 

(ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

(iv) a sequence which is substantially identical with any of those of (i), (ii) 
and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 4. 



9. The use of a protein or polypeptide having a sequence selected from those 
shown in Tables 2-4, or homologues, derivatives and/or fragments thereof, as an 
immunogen and/or antigen. 

10. An immunogenic and/or antigenic composition comprising one or more 
proteins or polypeptides selected from those whose sequences are shown in Tables 2- 
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4, or homologues or derivatives thereof, and/or fragments of any of these. 

11. An immunogenic and/or antigenic composition as claimed in claim 10 which 
is a vaccine or is for use in a diagnostic assay. 

5 

12. A vaccine as claimed in claim 11 which comprises one or more additional 
components selected from excipients, diluents, adjuvants or the like. 

13. A vaccine composition comprising one or more nucleic acid sequences as 
10 defined in Tables 1, 3 or 4. 

14. A method for the detection/diagnosis of S, pneumoniae which comprises the 
step of bringing into contact a sample to be tested with at least one protein or 
polypeptide as defined in Tables 2-4, or homologue, derivative or fragment thereof. 

15 

15. An antibody capable of binding to a protein or polypeptide as defined in 
Tables 2-4, or for a homologue, derivative or fragment thereof. 

16. An antibody as defined in claim 15 which is a monoclonal antibody. 

20 

17. A method for the detection/diagnosis of S.pneumoniae which comprises the step 
of bringing into contact a sample to be tested and at least one antibody as define din 
claim 15 or claim 16. 

25 18. A method for the detection/diagnosis of 5. pneumoniae which comprises the 
step of bringing into contact a sample to be tested with at least one nucleic acid 
sequence as defined in claim 7 or claim 8 . 
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19. A method of determining whether a protein or polypeptide as defined in 
Tables 2-4 represents a potential anti-niicrobial target which comprises inactivating 
said protein or polypeptide and determining whether S. pneumoniae is still viable in 
vitro or in vivo. 

20. The use of an agent capable of antagonising, inhibiting or otherwise 
interfering with the function or expression of a protein or polypeptide as defined in 
Tables 2-4 in the manufacture of a medicament for use in the treatment or 
prophylaxis of 5. pneumoniae infection 
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